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For systems with long-range interactions, the two-body potential decays at large distances as 
V(r) ~ l/r Q , with a < d, where d is the space dimension. Examples are: gravitational systems, 
two-dimensional hydrodynamics, two-dimensional elasticity, charged and dipolar systems. Although 
such systems can be made extensive, they are intrinsically non additive: the sum of the energies 
of macroscopic subsystems is not equal to the energy of the whole system. Moreover, the space of 
accessible macroscopic thermodynamic parameters might be non convex. The violation of these two 
basic properties of the thermodynamics of short-range systems is at the origin of ensemble inequiv- 
alence. In turn, this inequivalence implies that specific heat can be negative in the microcanonical 
ensemble, temperature jumps can appear at microcanonical first order phase transitions. The lack 
of convexity allows us to easily spot regions of parameters space where ergodicity may be broken. 
Historically, negative specific heat had been found for gravitational systems and was thought to be 
a specific property of a system for which the existence of standard equilibrium statistical mechanics 
itself was doubted. Realizing that such properties may be present for a wider class of systems has 
renewed the interest in long-range interactions. Here, we present a comprehensive review of the re- 
cent advances on the statistical mechanics and out-of-equilibrium dynamics of solvable systems with 
long-range interactions. The core of the review consists in the detailed presentation of the concept 
of ensemble inequivalence, as exemplified by the exact solution, in the microcanonical and canonical 
ensembles, of mean-field type models. Remarkably, the entropy of all these models can be obtained 
using the method of large deviations. Long-range interacting systems display an extremely slow 
relaxation towards thermodynamic equilibrium and, what is more striking, the convergence towards 
quasi-stationary states. The understanding of such unusual relaxation process is obtained by the 
introduction of an appropriate kinetic theory based on the Vlasov equation. A statistical approach, 
founded on a variational principle introduced by Lynden-Bell, is shown to explain qualitatively and 
quantitatively some features of quasi-stationary states. Generalizations to models with both short 
and long-range interactions, and to models with weakly decaying interactions, show the robustness 
of the effects obtained for mean-field models. 

PACS numbers: 

05.20.-y Classical statistical mechanics 

05.20.Dd Kinetic theory 

05.20.Gg Classical ensemble theory 

64.60.Bd General theory of phase transitions 

64.60.De Statistical mechanics of model systems 
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I. INTRODUCTION 



A wide range of problems in physics concerns systems with long-range interactions. H owev er, their statistical and 
dynamical properties are much less understood than tho se of short-range syst ems [73l 1 1 28| . O ne finds e xamples 
of long- rang e interacting syst ems in astrophysics [97], l259l | , plasma physics [l45| , hydrodynamics [245l . l274l | , atomic 
physics [2861 ] . nuclear physics |ll4| . This ubiquitous presence in different physics disciplines alone would itself justify 
the need for a general and interdisciplinary understanding of the physical and mathematical problems raised by 
long-range interacting systems. 

In this review, we are interested in systems with a large number TV of degrees of freedom, for which a statistical 
physics approach is mandatory, independently of the specific features of the interactions. Therefore, we will discuss 
in the follo wing; equilibrium and out-of-equilibrium properties of long-range systems relying on the tools of statistical 
mechanics Il83i. 




Let us define which is the property of the interaction that makes it short or long-ranged. We consider systems where 
the interaction potential is given by the sum, over pairs of the elementary constituents, of a two-body translationally 
invariant potential. For sufficiently large distances r, the absolute value of the two-body potential is bounded by 
r~ a . If the positive power a is larger than the dimension d of the space where the system is embedded, a > d, we 
define the system to be short-range. Otherwise, if a < d, the system is long-range. The reason for this definition is 
that, in the large N limit all the mathematical and physical differences between short and long-range systems can be 
traced back to this property of the interaction potential. We should remark that this definition of the range of the 
interaction does not coincide with others, where the range is instead defined by a characteristic length appearing in 
the interaction potential. In this latter definition, any interaction decaying as a power law at large distances, thus 
without characteristic length, is considered as long range. However, for the physical and mathematical problems found 
in the statistical mechanics of many-body systems, the definition used throughout this review is more appropriate, 
since, as we have stressed, it is related to the interaction property that determines the behavior of such systems for 
large N. 

Our purpose will be to illustrate, especially through the use of simple models, the peculiar properties of long-range 
systems, and the tools and techniques that are employed to describe them. To give a flavour of the issues that will be 
considered, we would like to begin with a very simple description of the main problems that one faces in the study of 
these systems. 

The aim of equilibrium stat istica l mechanics is to derive the thermodynamic properties of a macroscopic system 
from microscopic interactions [l83j . The connection between micro and macro is realized through the introduction 
of statistical ensembles. Different thermodynamic potentials describe situations in which different thermodynamic 
parameters are used in the characterization of the system, and, in the aforementioned connection with the microscopic 
interactions, different statistical ensembles are related to different thermodynamic potentials. However, it is usually 
stated (and experimentally verified for many physical systems) that, as far as macroscopic averages are concerned, 
i.e. in the thermodynamic limit (N — > oo, V — > oo with N/V = const.), the predictions of statistical mechanics do 
not depend on the chosen ensemble. Ensemble equivalence is related to the fact that, given a sufficient number of 
macroscopic thermodynamic parameters (two in a one-component system), the others are fixed in the large volume 
limit, apart from vanishingly small relative fluctuations. 
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An important feature of long-range systems is that ensembles can be inequivalent [42|, [9?], 1 1771 . 12051 |297| . and 
therefore one of the main issues in the statistical mechanics of these systems is a careful examination of the relations 
between the different ensembles, in particular of the conditions that determine their equivalence or inequivalence. We 
emphasize that ensemble inequivalence is not merely a mathematical drawback, but it is the cause, as it will be shown 
in this review, of physical properties of these systems that could be experimentally verified. Probably, one of the 
most stri king features of long-range sy stems is the possibility to display negative specific heat in the microcanonical 
ensemble [III Il39l 1 1461 . Il77l 12341 . |297| . Specific heat is always positive in the canonical ensemble, independently of 
the nature of the interactions, since it is given by the expectation value of a positive quantity. It turns out that 
microcanonical equilibrim contains all informations about canonical equilibrium, while the converse is wrong in case 
of ensemble inequivalence [l44l . Il65| | . The discrepancy between the two ensembles extends to other observables related 
to the response of the system to a change in a thermodynamic parameter: a concrete example will be given discussing 
magnetic susceptibility. 

As far as out-of-equilibrium dynamical properties are concerned, many-body long-range systems again show peculiar 
behaviors. The approach to equilibrium of short-range systems is usually characterized by the time scales that govern 
the equations of motion of the elementary constituents [2^ |. For systems without disorder, these time scales are 
typically small when compared to the observational time scales. Sometimes these systems can be trapped in metastablc 
states that last for long times. These states are local extrema of thermodynamic potentials and, in practical cases, 
their realization requires a very careful preparation of the system (e.g. undercooled liquids and superheated solids). 
If perturbed, the system rapidly converges towards the equilibrium state. It can also happen that the relaxation time 
depends on the volume if hydrodynamics modes are present or in coarsening processes. 

For long-range systems, dynamics can be extremely slow and the approach to equilibrium can take a very long 
time, that increases with the number N of elementary constituents [47] . This feature is induced by the long-range 
nature of the interaction itself and is not a consequenc e of t he existence o f a collective phenomenon. The state of 
the system during this long transient is quasi-stationary [l09l . |217| . |230| . l316l | , since its very slow time evolution allows 
us to define slowly varying macroscopic observables, like for local equilibrium or quasi-static transformations. It 
should be however remarked that quasi-stationary states are not thermodynamic metastable states, since they do not 
lie on local extrema of equilibrium thermodynamic potentials. The explanation of their widespread presence should 
rely only upon the dynamical properties of the systems. It must be stressed that the nature of quasi-stationary 
states can be strongly dependent on the initial condition, as it will become clear from the examples that we will 
give. In addition, a variety of macroscopic structures can form spontaneously in out-of-equilibrium conditions for 
isolated systems: a fact that should not be a surprise given that already the equilibrium states of long-range systems 
are usually inhomogeneous. In short-range systems, macroscopic structures can a rise as an effect of an external 
forcing (Rayleigh-Benard convection, Benjamin- Feir instability, Faraday waves) [l52j | or due to the nonlinearity of the 
governing equations of motion (solitons, breathers) [l27l |. but are usually strongly selected by the specific dynamical 
properties and by the geometrical conditions. All this shows the great richness of the dynamics of long-range systems. 

Summarizing, a satisfying theoretical framework concerning the behaviour of system with long-range interactions 
should necessarily address the following aspects: 

• For what concerns equilibrium properties, the determination of the physical conditions that determine equiva- 
lence or inequivalence of the statistical ensembles and, for the latter case, the relation between macrostates in 
the different statistical ensembles. 

• As for the out-of-equilibrium features, the development of a consistent kinetic theory able to explain the 
formation of quasi-stationary states and their final relaxation to equilibrium. 

Although different aspects of systems with long-range interactions have been studied in the past in specific scientific 
communities, notably astrophysics and plasma physics, this has not constituted a seed for more general theoretical 
studies. In the last decade or so, it has become progressiv ely clear that the ubiquitous presence of long-range forces 
needs an approach that integrates different methodologies [TH, [l28] . This has induced a widespread interest in long- 
range systems throughout numerous research groups. The successive development has led to a better understanding 
of both the equilibrium and out-of-equilibrium properties of such systems. Time is ripe to summarize what is known 
on firm basis about the equilibrium statistical mechanics of systems with long-range interactions and to describe the 
preliminaries of a theory of non equilibrium. 

In this review, we have chosen to present the main problems, tools and solutions by discussing paradigmatic 
examples, which are simple and general enough to be useful also for specific applications. Therefore, we will mostly 
emphasize: 

• the equilibrium statistical mechanics solutions of simple mean-field toy models, which is a first step for under- 
standing phase diagrams of more complex long-range systems; 
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• the basic ingredients of a kinetic theory of a model able to catch the essential properties of out-of-equilibrium 
dynamics. 

• the analysis of the phase diagram of models with both short and long-range interactions, or with forces that 
weakly decay in space. 

The structure of the review is the following. In section|TTl we introduce the subject presenting the definition of long- 
range interactions and discussing the non additivity property and its consequences. In section Hill we briefly review 
the relevant physical systems with long-range interactions: gravitational systems, 2D hydrodynamics, 2D elasticity, 
charged systems, dipolar systems and small systems. Section [IV] constitutes the core of the review. We present there 
the equilibrium properties of several mean-field models for which one can analytically compute both the free energy 
and the entropy. The solution is obtained by different methods, including the powerful large deviation method, which 
allows to solve models with continuous variables. Dynamics is tackled in section [V] where the main result reviewed 
is the widespread presence of quasi-stationary states that hinder relaxation to Boltzmann-Gibbs equilibrium. This 
phenomenon is strongly supported by numerical simulations and can be studied using kinetic equations explicitly 
devised for long-range interactions (Vlasov, Lenard-Balescu) . Few analytical results exist for non mean-field models: 
we collect some of them in section I VII The message is that the introduction of short-range terms doesn't spoil the 
features of mean-field models and that some weakly decaying interactions can be treated rigorously. Finally, we draw 
conclusions and we discuss some perspectives in section fVIIl 



II. THE ADDITIVITY PROPERTY AND THE DEFINITION OF LONG-RANGE SYSTEMS 



A. Extensivity vs. Additivity 

In order to easily illustrate the issues of extensivity and additivity, it is useful to consider a concrete example and 
restrict ourselves to the energy as the thermodynamic extensive variable. We employ a very simple model that is used 
in the study of magnetic systems, namely the Curie- Weiss Hamiltonian 

N / N \ 2 

i,j=l \i=l / 

where the spins Si = ±1 are attached to sites labeled by i = 1, . . . , N. In this example, the interaction does not decay 
at all with the distance: indeed, each spin interacts with equal strength with all the other spins. Such systems are 
usually referred to as mean-field systems. With the 1/N prcfactor in (JTJ) , the total energy increases as N, then the 
energy per spin converges to a finite value in the thermodynamic limit, which is a physically reas onab le requirement. 
We recall that one can find rigorous definitions of the thermodynamic limit in Ruelle's book |278j |. Model ([l} is 
extensive: for a given intensive magnetization 

N JV ' ( ' 

where M is the extensive magnetization, if one doubles the number of spins the energy doubles. On the other hand, 
Hamiltonian (TTJ) is not additive, in spite of the presence of the regularizing factor 1/N. Indeed, let us divide the 
system, schematically pictured in Fig. [TJ in two equal parts. In addition, let us consider the particular case in which 
all spins in the left part are equal to +1, whereas all spins in the right part are equal to —1. The energy of the 
two parts is E\ = E 2 = — JN/4. However, if one computes the total energy of the system, one gets E = 0. Since 
E =/= Ei + E%, such a system is not additive, at least for this configuration. One could easily generalize the argument 
to generic configurations. The problem is not solved by increasing system size N, because the "interaction" energy 
E li2 = E-E 1 -E 2 = JN/2 increases with N. 

Extensivity in the Curie- Weiss model is provided by t he 1/ N prefactor in Hamiltonian (fT]) , which makes the energy 
proportional to N. This is the so-called Kac prescription [19l| . This energy rescaling guarantees a competition between 
energy E and entropy S, which is crucial for phase transitions. Indeed, introducing the free energy F = E — TS, 
where T is temperature, Kac prescription implies that the two competing terms on the r.h.s. both scale as N, since 
temperature is intensive and entropy scales like N. This latter scaling deserves a further analysis which will be 
developed later on. Intuitively, the fact that the interaction is long-range does not alter the density of states, which 
usually grows factorially with TV. An alternative prescription would be to make temperature extensive T — > TN, in 
such a way that energy and entropy term of the free energy both scale as N 2 . The two prescriptions give equivalent 
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FIG. 1: Schematic picture of a system separated in two equal parts with N/2 spins up in domain 1 and N/2 spins down in 
domain 2. 

physical consequences. In systems with kinetic energy, being temperature the average kinetic energy per particle, 
rescaling temperature corresponds to a renormalization of velocities, and finally of the time scale. 

After having defined the two distinct concepts of extensivity and additivity with reference to a specific model, let 
us clarify more generally these two notions. 

Indeed, it is very important, as a first step, to give the general definitions of extensivity and additivity, and to 
clarify the distinction between these two concepts. It is convenient to first consider the situation that one encounters 
in short-range systems. We can imagine to divide a system at equilibrium in two parts occupying equal volumes. 
Some thermodynamic variables of each half of the system will be equal to the corresponding ones of the total system, 
others will be halved. Temperature and pressure are examples of the first kind of thermodynamic variables: they do 
not depend on the size of the system and are called intensive variables. Energy, entropy and free energy are variables 
of the second kind; their value is proportional to system size, i.e. to the number of elementary constituents (for 
given values of the intensive variables), and they are called extensive variables. The property that the size dependent 
thermodynamic variables are proportional to system size is called extensivity, and systems with this property are 
called extensive. The specific value of extensive variables (e.g., the energy per unit particle, or per unit mass, or per 
unit volume) give rise to new intensive quantities. Considering the energy of a system, we see that it has also the 
property of additivity, that consists in the following. Dividing the systems in two macroscopic parts, the total energy 
E will be equal to E\ + E2 + Ei nt , with Ei the energy of the i-th part, and Ei nt the interaction energy between the two 
parts. In the thermodynamic limit the ratio Ei n t/{E\ + E2) tends to zero; therefore in this limit E E\ + -£2- This 
property is called additivity; systems with this property (for the energy as well as for other size dependent quantities) 
are called additive. It is evident that extensivity and additivity are related. Indeed, in the definition of extensivity 
just given, we could not have concluded that the energy of each part of the system is half the total energy, if the 
interaction energy E int would not be negligible. Additivity implies extensivity (thus non extensivity implies non 
additivity), but not the reverse, since the interaction energy might scale with N, as in the Curie- Weiss model. This 
comment applies more generally to all long-range systems. The unusual properties of these systems derive from the 
lack of additivity. 

B. Definition of long-range systems 

We have just shown that mean-field systems like the Curie- Weiss model can be made extensive using Kac's trick. 
However, extensive systems could be non additive. Here, we will discuss the case of interactions that decay as a power 
law at large distances. We will show that the energy e of a particle (excluding self-energy) diverges if the potential 
does not decay sufficiently fast, implying that the total energy grows super linearly with volume at constant density, 
which violates extensivity. These interactions are called long-range or non integrable, just referring to this divergence 
of the energy. Energy convergence can be restored by an appropriate generalization of Kac's trick. 

Let us estimate the energy e by considering a given particle placed at the center of a sphere of radius R where the 
other particles are homogeneously distributed. We will exclude the contribution to e coming from the particles located 
in a small neighborhood of radius S (see Fig. [5]). This is motivated by the necessity to regularize the divergence of 
the potential at small distances, which has nothing to do with its long-range nature. 

If the other particles interact with the given one via a potential that at large distances decays like l/r a , we obtain 
in d-dimensions 

e = [ R d <V p— = pJQ d ( R r'-^dr = ^ \R d ~ a - S d ~ a ] , if a ± d , (3) 

Js ra J s d-a 

where p is the generic density (e.g. mass, charge), J is the coupling constant and £ld is the angular volume in dimension 
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FIG. 2: Schematic picture of the domain considered for the evaluation of the energy e of a particle. It is a spherical shell of 
outer radius R and inner radius S. 

d (2w in d = 2, 4n in d — 3, etc.). When increasing the radius R, the energy e remains finite if and only if a > d. This 
implies that the total energy E will increase linearly with the volume V, i.e. the system is extensive. Such interactions 
are the usual short-range ones. On the contrary, if a < d the energy e grows with volume as V 1 ~ a ^ d (logarithmically 
in the marginal case a — d). This implies that the total energy E will increase superlinearly, E oc y 2 ~ a / d ^ with 
volume. However, analogously to the Kac's prescription, one can redefine the coupling constant J — > JV a / d ~ 1 and 
get a perfectly extensive system. Mean- field models, like Hamiltonian (JTJ) , correspond to the value a = 0, since the 
interaction does not depend on the distance, and one recovers the usual Kac's rescaling. Cases where the energy grows 
superlinearly define the long-range nature of the interaction. However, as we have shown for mean-field systems, the 
fact that energy can be made extensive, does not imply that the system is additive. Implications of the lack of 
additivity for long-range systems will be discussed throughout the paper. 

C. Convexity in thermodynamic parameters 



E 




M 

' ► 

FIG. 3: The set of accessible macrostates in the (M, E) space can have a non-convex shape for systems with long-range 
interactions, such that if {M\,E\) and (M2,E2) can be realized macroscopically, this is not necessarily true for all the states 
joining these two along the straight dashed line. 

Let us consider the space of the extensive thermodynamic parameters. To be specific, as in Fig. [3J we consider the 
(E, M) plane of energy and magnetization The attainable region in this space is always convex when only short- 
range interactions are present. This property is a direct consequence of additivity. Consider two different subsystems 
with two different energies E\ and E2, and two different magnetization values Mi and Mi- Introducing a parameter 
A taking values between and 1, depending on the relative size of the subsystems, the system obtained by combining 
the two subsystems has an energy E = \E\ + (1 — X)E2, and a magnetization M = XMi + (1 — A)M2. Any value 
of A between and 1 is realized thermodynamically, just varying the relative size of the two subsystems. This is 
exactly the convexity property of the attainable region of the space of thermodynamic parameters. In particular, one 
can consider two subsystems with the same energy but different magnetization. Varying the relative size of the two 
subsystems, the combined system will have the same energy and any possible magnetization between the two values 
of the two subsystems. It is important to stress that the convexity property is possible if additivity is satisfied, since 
the interaction energy between the two subsystems has been neglected. As already remarked, the additivity property 
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is generically valid for large enough and short-range interacting systems. Moreover convexity implies that the space 
of thermodynamic parameters is connected. 

On the contrary, systems with long-range interactions are not additive, and thus intermediate values of extensive 
parameters are not necessarily accessible (see Fig. This feature has profound consequences on the dynamics 
of systems with long-range interactions. Gaps may open up in the space of extensive variables. Since the space 
of thermodynamic parameters is no more connected, ergodicity breaking might appear when considering continuous 
microcanonical dynamics of such a system. We will discuss again this question in detail in Scc. lIVE4l bv emphasizing 
simple examples. 

D. Lattice systems 

When defining the long-range nature of the interaction, care must be taken of the specific nature of the microscopic 
variables. These could be divided in two classes: the coordinates related to the translational degrees of freedom (e.g. 
cartesian coordinates), and those giving the internal state of each particle (e.g. spin variables). Both could be cither 
continuous or discrete. When coordinates take fixed discrete values, one speaks of lattice systems. As for the internal 
degrees of freedom the variables could be discrete (e.g. spin systems) or continuous (e.g. systems of rotators). 

For systems with continuous translational degrees of freedom that do not possess internal degrees of freedom, the 
potential energy can be written in the general form 

u(r l7 ...,r N )= J2 m^i-^-l). ( 4 ) 

l<i<j<N 

where ( r i, . . . , r jy) are cartesian coordinates in d-dimensional space and we assume that the translationally invariant 
pair potential V depends only on the modulus of the distance between two particles. Systems of gravitational point 
masses or Coulomb point charges fall into this category. 

In a lattice system, for which each site i of a (/-dimensional lattice, located by the position vector r,, hosts a particle 
possessing one or more internal degrees of freedom, collectively denoted by the vector (the dimensionality of this 
vector is independent of d), the potential energy can be written as 

N 

[/(qx, . . . , q N ) = C H V (<H> <fe) + 9 E V ^ > ( 5 ) 

l<i<j<N t=l 

where the coupling constants Cij are translationally and rotationally invariant (i.e., they depend only on \~r*i — ~r*j\). 
We allow also the presence of an external field that couples to the particles via the function V e (q) with a strength g. 
As pointed out above, the variables may take continuous or discrete values. 

We have already discussed in the previous Subsection that long-range systems of type (0} are those for which, at 
large distance, V(r) ~ r~ a with a < d. Similarly, long-range lattice systems (0 can be characterized by a slow 
decay of the coupling constants Cij. If these latter behave at large distance like | r , — r j\~ a with a < d, the system 
is long-range. The energy grows also in this case superlinearly with the volume and one will need to introduce a 
Kac rescaling factor. At variance with systems of type ^ we need not worry about the behavior at short distances, 
because the lattice regularizes any possible divergence. 

A kinetic energy term can be added to the potential energy one for systems of type ^ and for lattice systems ([5]) 
when the variables are continuous. 

In this review, we will mostly concentrate our attention on lattice systems. Since they cannot display any short 
distance singularity, their thermodynamic and dynamical behaviour highlights the essential features of long-range 
interactions. Therefore, these systems are more suitable to present an overview of the main results and of the tools 
used to deal with long-range interactions. 

E. Non additivity and the canonical ensemble 

Let us recall that, in d = 3, neglecting the dependence on the number of particles N and on the volume V, the 
microcanonical partition function (proportional to the number of microstates with a given energy E) is defined as 



Q(E)~ J d 3N qd 3N p 5(E-H(p,q)), 



(6) 
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where (p, q) are the phase space coordinates, H is the Hamiltonian and we forget for the moment about multiplicative 
A^-dependent factors and dimensional constants (for a more precise definition see Section |IV| . The entropy is defined 
via the classical Boltzmann formula 



S(E) =lnfi(B) 



(7) 



where we adopt units for which the Boltzmann constant fcs is equal to 1. 

The non additivity has strong consequences on the construction of the canonical ensemble from the microcanonical. 
The reasoning usually goes as follows. One considers an isolated system with energy E, that we divide into a "small" 
part with energy E\ (the subsystem of interest) and a "large" part with energy E 2 which plays the role of the bath. 
The additivity of the energy implies that the probability distribution that the "small" system has an energy E\, let 
us call it p(Ei), is given by 



p(Ei) = J n 2 (E 2 ) 5{E l +E 2 - E)dE 2 



= n^E-E,) . 

Using the entropy to express fl 2 and expanding the term S 2 (E — Ei), one gets 



p(Ei) = exp [S 2 (E — Ei)] 



exp 



S 2 (E)-E 1 



0S 2 
dE 



where 



oc Q 2 (E) e 







-/3Ei 

dS 2 
dE 



(8) 
(9) 

(10) 
(11) 
(12) 

(13) 



One would end up with the usual canonical distribution for the system of interest after performing the thermodynamic 
limit [278]. Let us however remark that this derivation is valid also before t aking the thermodynamic limit. This has 
led many authors to develop a thermodynamic formalism for finite systems [1 141 ] . 

It is clear that additivity is crucial to justify the factorization hypothesis implied in ([8]), and hence the existence of the 
canonical distribution. This hypothesis is clearly violated when the system size is finite, because of a contribution to the 
entropy coming from the surface that separates the two subsystems. However, after performing the thermodynamic 
limit, this contribution becomes negligible when the interactions are short-range. This is not true for long-range 
interactions which arc non additive also in the thermodynamic limit. This has led the community to split into two 
different attitudes. On one side are those who think that the canonical ensemble cannot be appropriately defined 
for long-r ange interactions and claim that all the analyses should be performed using the microcanonical ensemble 

[l65l . Il66| ] . Although clearly appropriate for isolated system, this approach cannot describe ope n system s. O n th e 

other side are those who stress that the canonical ensemble could be still formally defined and used [9J, |204| . |259| . |26Q| ] . 
Among the two sides are those who attempt operative definitions of a heat bath. For instance, one has imagined that 
the heat bath is a short-range system that interacts with the system of interest with short-range interactions, so that 
the non additivity is due only to the system [24|, [H[ . The failure of the usual derivation of the canonical ensemble 
suggests th at n o n additiv e systems m ight have a very peculiar behavior if they arc in contact with a thermal reservoir 
(see Refs. j233l |267| . l270l l27ll 13061 ] for recent literature on this topic). A third position was recently initiated by 
Bouchet and Barre who proposed that when considering systems with long-range interactio n, th e canonical ensemble 
does not describe fluctuations of a small part of the whole system. However, they argued [60l . l309| that it may describe 
fluctuations of the whole system when coupled to a thermostat via a negligibly small coupling. This interesting line 
of thought needs to be pursued theoretically and confirmed by numerical simulations. 

In the following Section, we will discuss some physical examples before proceeding to the core of the review. 



III. PHYSICAL EXAMPLES OF LONG-RANGE INTERACTING SYSTEMS 



There are many systems in nature where particles interact with a pair potential that decays at large distances as 
V(r) ~ r~ a with a < d. Although these systems are deeply studied in their own sake (e.g. gravitational many body 
systems, Coulomb systems, systems of vortices, etc.), they rarely appear in books of statistical mechanics. This is of 
course due to the difficulty to deal with systems that are non extensive and non additive. We give here a brief sketch 
of the physics involved in some of these systems. 

In Fig. 21 we draw in the (a, d) plane some of the physical systems with long-range interactions that we will discuss 
afterwards. 
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FIG. 4: Location of some physical systems in the plane where the abscissa is space dimension d and the ordinate is the exponent 
a characterizing the spatial decay at large distances of the pair potential. 



A. Gravitational systems 



Gravitational systems, which correspond to a = 1 in dimension d = 3, clearly belong to the category of long-range 
interacting systems. The gravitational problem is particularly difficult because, in addition to the non additivity 
due to the long-range character of the interaction, one also needs a careful regularization of the potential at short 
distances to avoid collapse. To be more specific, let us consider the canonical partition function of a system of N 
self-gravitating particles of the same mass m moving inside a volume V 



N 

1 / i 

N 



N 

/ n d ^ cx p ^i, ■ • ■ - ~?n)\ , (14) 



with 



(27rA 2 ) 3A 7 2 iV! Jyf^ 
where 

JV 

u{-? 1 ,...,r N ) = -g m 2 Y J v(\n-r 3 \) , (15) 

i<j 

V(r) = - , (16) 
r 

where (3 is the inverse temperature, Q the gravitational constant, and A = K{j3 /m) 1 / 2 the De Broglie wavelength. In 
the following, we will not introduce dimensional constants, i.e. K, in the definition of partition functions for a matter 
of convenience (see e.g. Eq. (J53J) ) . From the shape of the potential represented in Fig. [5] by the solid line, one clearly 
sees that Zn will diverge if at least two particles collapse towards the same point. This difficulty arises because the 
potential is not bounded from b elow as for the Lennard-Jones or Morse potential. In quantum mechanics, the collapse 
of self-gravitating fermions [97l . ll78j . a system which is physically relevant for dwarfs and neutron stars, is forbidden 
by the Pauli exclusion principle, that introduces a natural effective small scale cut-off. However, to avoid the use 
of quantum concepts and stick to a classical model, the usual trick is to introduce an ad-hoc cut-off [47| . One way 
of regularizing the potential is shown in Fig. [5] by the dotted line. One can imagine that the potential has a hard 
core, which represents the particles' size. As a consequence, the inequality U(r) > —Gm 2 C = —C allows to easily 
determine a finite upper bound of the configurational partition function 
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FIG. 5: The gravitational potential as a function of the distance r is represented by the solid curve. The dotted curve shows 
the short-distance regularized potential which avoids gravitational collapse, —C is the lower bound of the potential. 



As far as the micro-canonical ensemble is concerned, a standard argument tells us that the entropy of a self- 
gravitating system in a finite volume might have convergence problems if the potential is not regularized at short 
distances. Indeed, let us consider N gravitating particles grouped together in a finite volume. A strong decrease of 
the potential energy of a pair of particles — Qm 2 /{\ ~rt — rj |) is obtained when n tends to Tj. Since the total energy 
is a conserved quantity, the kinetic energy would correspondingly increase. This process leads to an increase of the 
accessible phase volume in the direction of momentum, and hence to an entropy increase. Since the process extends 
to the limit of zero distance among the particles, it might induce a divergence of the density of states, and then of 
Bol tzma nn entropy. However, one can show that this does not happens for N = 2 and a more subtle derivation 
[3 l259l | reveals that the entropy integral diverges only when N > 3. It's remarkable that this also corresponds to 
the transition from an integrable (N = 2) to a non integrable (N = 3) gravitational system. As a consequence of this 
discussion, "equilibrium" states can exist only in association with local entropy maxima [l9l , l23fjj | . 

Besides these unusual properties of the canonical and microcanonical partition functions, self-gravitating systems 
were historically the first physical system for which ensemble inequivalence was discovered through the phenomenon 
of negative specific he at. The possibility of fin ding a negative specific heat in gravitational systems was already 
emphasized by Emden 1 1461 a nd Eddington |X 3Qj ] . The divergence of the phase space volume for gravitational systems 
was prov ed by Antonov [1J|, and the fact that this implies negative specific heats was later stressed by Lyndcn- 
Bell [234j . An early remark on the possib ility of having a negative specific heat can be found in the semin al review 
paper on statistical mechanics by Maxwell [240j | . This was for long time considered a paradox, until Thirring |l77t l297| 
finally clarified the controversial point by showing that the paradox disappears if one realizes that the microcanonical 
specific heat can be negative only in the microcanonical ensemble. Therefore, one can attribute to Thirring the 
discovery of ensemble inequivalence. 

Let's rephrase Thirring's argument. In the canonical ensemble, the mean value of the energy is computed from the 
partition function (|14| as 

It is then straightforward to compute the heat capacity at constant volume 

C v = ^=^((E-(E)j 2 )>0. (19) 

Let's remind that ks = 1- This clearly shows that the canonical specific heat is always positive. Notice also that this 
condition is true for systems of any size, regardless of whether a proper thermodynamic limit exists or not. 

For self-gravitating systems at constant energy (i.e., in the microcanonical ensembl e) a simple physical argument 
which justifies the presence of a negative specific heat has been given by Lynden-Bell |23l| . It is based on the virial 
theorem, which, for the gravitational potential, states that 

2{K) + (U) = 0, (20) 

where K and U are the kinetic and potential energy, respectively. Recalling that the total energy E is constant 

E=(K) + (U) = -(K), (21) 
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where in the second identity we have used the virial theorem (|20|) . and since the kinetic energy K defines the 
temperature, one gets 

dE dE 

Cv =OT*OK <0 - (22) 

Loosing its energy, the system becomes hotter. 

There is a further difficulty in the case of gravitational interactions: the system is open, i.e. without boundary, 
strictly speaking. Therefore, the microcanonical partition function ([6|) will diverge. This divergence is actually not 
peculiar of self-gravitating systems since it would also occur for a perfect gas. Although for gases it is natural to 
confine them in a box, this is completely unjustified for gravitational syste ms. A way out from this pitfall is to 
consider an expanding universe and reconsider the problem in a wider context |26dl |. 

A detailed discussion on phase transitions in self-gravitating syste ms, i n both the canonical and microcanonical 
ensemble, is not the aim of this review, and can be found in Refs. [9?1 l259l |. 



B. Two-dimensional hydrodynamics 



Two-dimensional incompressible hydrodynamics is another important case where long-range interactions appear. 
Although high Reynolds number flows have a very large number of degrees of freedom, one can often identify structures 
in the flow. This suggests that one could use a much smaller number of effective degrees of freedom to characterize 
the flow. This remark is particularly valid for two-dimensional flows, where the inverse energy cascade leads to 
the irreversible formation of large coherent structures (e.g. vortices). A system with a large number of degrees 
of freedom which can be characterized by a small number of effective parameters, is reminiscent of what happens 
in thermodynamics [86| . where a few macroscopic variables describe the behavior of s ystem s composed of many 
particles. Stat istic al mechanics for turbulence is nowadays a very active field of research [1481 ]. initiated long ago by 
Lars Onsager [257| |. We will discuss in the following the long-range character of two-dimensional hydrodynamics. 

The velocity of a two-dimensional flow can be expressed in terms of the stream function ip(r), where r = (x,y) 
is the coordinate on the plane 

v x = + ^ (23) 
dy 



The vorticity u> is related to the velocity field 



and, hence, to the stream function by the Poisson equation 

lu = -Atp . (26) 
Using the Green's function G ( r , r '), one easily finds the solution of the Poisson equation in a given domain D 

il){~r)= I Al^' u{~r") G (T*, T* 7 ) , (27) 

plus surface terms Q . In an infinite domain, 



D 



G{~r,l?') = — -In IT" - ~r'\. (28) 



The energy is conserved for the Eulcr equation and is given by 



D 



ijr±(vl+v 2 y ) (29) 

d^l (V^) 2 (30) 

D 1 

\j w(r^(T>) (31) 

= / / d7 > d7 > 'a;(7 > X7 > )ln|7 > -7 > '| . (32) 

4tt .IdJd 
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This emphasizes that one gets a logarithmic interaction between vorticies at distant locations, which corresponds to a 
decay with an effective exponent a = 0, well within the case of long-range interactions (see Fig.QJ. For a finite domain 
D the Green's function contains additional surface terms which however gives no contribution to the energy ([29]) 
if the velocity field is tangent to the boundary of the domain (no outflow or inflow). 
Another important conserved quantity is enstrophy, defined as 

A = \J D &r[u J {r)] 2 . (33) 

The long-range character of the interaction is even more evident if one approximates the vorticity field by N point 
vortices located at ~r \ = (xj, yi), with a given circulation I\ 

N 

u>C?)=Y, T i 6 (~?-^i) ■ ( 34 ) 



The energy of the system reads now 



&3 



where we have dropped the self-energy term because, although singular, it would not induce any motion 238]. 
Considering the two coordinates of the point vortex on the plane, the equations of motion are 

The phase-space volume contained inside the energy shell H = E can be written as 

N 

he) = / n d?>< e{ * E - h( ?i> ^ 

2 — 1 

8 being the Heaviside step function. The total phase space volume is $(oo) = A N , where A is the area of the domain 
D. One immediately realizes that $>(E) is a non-negative increasing function of the energy E with limits $(— oo) = 
and $(oo) = A . Therefore, its derivative, which is nothing but the microcanonical partition function ©, is given 
by 

r N 

Q(E) = &(E) = / II dT> * S ( E - H (?i> -^n)), (39) 

J i=l 

and is a non-negative function going to zero at both extremes f2(±oo) = 0. Thus the function must achieve at least one 
maximum at some finite value E m where D!{E m ) = 0. For energies E > E m , Cl'(E) will then be negative. Using the 
entropy S(E) = \nil(E), one thus gets that the inverse temperature dS/ dE is negative for E > E m . This argument 
for the existence of negative temper ature s was proposed by Onsager |257j two years before the experiment on nuclear 
spin systems by Purcell and Pound [269] reported the presence of negative "spin temperatures" . 

Onsager also pointed out that negative temperatures could lead to the formation of large-scale vortices by clustering 
of smaller ones. Although, as anticipated, the canonical distribution has to be used with cautions for long-range inter- 
acting systems, the statistical tendency of vortices of the same circulation sign to cluster in the negative temperature 
regime can be justified using the canonical distribution exp(— j3H). Changing the sign of the inverse temperatures 
(3 would be equivalent to reversing the sign of the interaction between vortices, making them repel (resp. attract) if 
they are of opposite (resp. same) circulation sign. 

After a long period in which Onsager's statistical theory was not further explored, this domain of re searc h has 
made an impressive progress recently. One might in particular cite the work of Joyce and Montgomery |l90| . who 
have considered a system of vortices with total zero circulation. Maximizing the entropy at fixed energy, they obtain 
an equation for the stream function which gives exact stable stationary solutions of the 2D Euler equations, able 
to descr ibe t he macroscopic vortex formation proposed by Onsager for negative temperatures. Later, Lundgren and 
Pointin [229j ] studied the effect of far vorticity field on the motion of a single vortex, showing that it produces a positive 
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eddy viscosity term leading to an increase of cluster size. Subsequently, Robert [274| and Miller [245l ] have elaborated 
an equilibrium statis tical mechanic al theory directly for the continuum 2D Eulcr equation. Nice pro l ongations alo ng 
these lines are Refs. jl09ll243l . l276| . together with applications to the Great Red Spot of Jupiter [58l. l63l . fl08l l303l | . 

The canonical ensemble can also be defined for both the Euler equation in 2D and the Onsa ger point vortex m odel. 
It turns out that in both cases, canonical and microcanical ensembles may be inequivalent [69l . 1 1421 . 1 1431 . |205| ]. In 
particular for the Euler equation in the region of negative temperature (where vorticity tends to accumulate at the 
center of the d omain) some hybrid states are shown to be realized in the microcanonical ensemble but not in the 
canonical [l42| . As far as the point vortex model is co ncern ed, microcanonical stable states, that are unstable in the 
canonical ensemble, are found for specific geometries [205ll . In both cases, microcanonical specific heat is negative. 
Reviews on 2D turbulence can be found in Refs [U Ill8l . |287| . l292j . Another r elate d interesting case of ensemble 
inequivalence has been recently reported in the context of physical oceanography [30Sl | . 



C. Two-dimensional elasticity 

Let us discuss the planar stress and displacement fields around the tip of a slit-like plane crack in an ideal Hookcan 
continuum solid. The classical approach to a linear elasticity problem of this sort involves the search for a suitable 
"stress function" that satisfies the so-called biharmonic equation 

V 2 (V 2 t/0 = (40) 

where ip is the Airy stress function, which has to satisfy appropriate boundary conditions. The deformation energy 
density is then defined as U ex ere where a is the fracture stress field around the t ip, whereas e is the deformation 
field. Considering a crack- width a and using the exact Muskhelishvili's solution |249l |. one obtains the elastic potential 
energy due to the crack 



where E is the Young modulus, Coo the stress field at infinity, v the Poisson coefficient and r the distance to the 
tip. The elasticity equation in the bulk of two-dimensional materials leads therefore to a marginal case of long-range 
interaction, since U ~ 1/r 2 in d = 2. Looking at the important engineering applications, the dynamics of this non 
conservative system should be better studied: the difficulty lies again in the long-range nature of the interaction. In 
addition, in such a two dimensional material, the presence of several fractures could exhibit a very interesting new 
type of screening effects. 



D. Charged systems 

The partition function of system of charges is basically the same as the one for gravitational systems displayed in 
formula (TH")) with the potential given by 

1 N 

£/( r\, . . . , = _ £ e t e 3 V(\7t - 7]\) , (42) 

i<j 

where a = ±e is the charge of the particle and V is given by formula (|16[) . If the total charge is non zero, the excess 
charge is expelled to the boundary of the domain and the bulk is neutral. Hence, neglecti ng bound ary contributions 
(which are non extensive), one usually considers an infinite medium with total zero charge |220l . l226j ] . Similarly to the 
gravitational case, the partition function would diverge if not properly regularized at short distances. This is usually 
done by supposing that additional forces are present at short distances, either hard core of radius A or smoothed 
Coulombic singularities of the type 

1 - exp(-r/A) 

V smooth ~ • (4oJ 

r 

The regularized partition function allows one to perform the thermodynamic limit and derive physical quantities, like 
the pressure. 

As far as the large distance behavior is concerned, several rigorous results exist [67| that prove, under appropriate 
hypotheses (low density and high temperature), that the effective two-body potential is Debye-Hiickel screened 

cxp(-r/£ D ) 

V ^f Z ' \ U > 
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where £o = (eo/(2ne 2 /3)) 1 / 2 is the Debye length, with n the density. Among the hypotheses, the most important one 
for its possible physical consequences is the one of low density or high temperature. This suggests that all pathologies 
related to ensemble inequivalence (e.g. negative specific heat), that is a consequence of the long-range nature of the 
interaction, will be indeed absent for charged systems. 

An interesting different situ ation is the one of plasmas consisting exclusively of single charged particles (pure 
electron or pure ion plasmas) |l34l |. Charged particles are confined by external electric and magnetic fields. The 
thermodynamics of such systems is not affected by short-distance effects because particles repel each other. On the 
contrary, the behavior at large distances is different from the one of glob ally neutral low density plasmas. Nevertheless, 
by studying the correlation at equilibrium, a Debye length emerges |l34{ (this behavior is also present in self-gravitating 
systems). Experimentally, relaxation to thermal equilibrium has been observed only in some specific conditions. The 
system has been sho wn to relax into quasi-stationary states, including minimum enstrophy states (see Eq. Q33p ) and 
vortex crystal states jl54l Il84| . Indeed, in the particular case of a pure electron plasma in a cylindrical container 
with a strong magnetic field applied along the axis of the cylinder, it can be shown that the electron motion in the 
plane perpendicular to the magnetic field obeys equations that are the same as the Onsager point vortex model (]36|) 
and (|37|) . Therefore, all what has been written above about two-dimensional hydrodynamics applies to this system, 
including the exist ence of negative specific heat which has been explicitly demonstrated in a magnetically self-confined 
plasma torus [206l ] . 



E. Dipolar systems 



Systems of electric and magnetic dipoles share many similarities, but also have important differences (force in 
non uniform external fields, the symmetry axial/polar of the dipolar vector). However, for what concerns statistical 
mechanics, the two systems are equivalent. Magnetic dipolar systems are easier to realize in nature; let us then 
concentrate on them. The interaction energy between two magnetic dipoles is 



3 4tt 



where fi , is the magnetic moment, r . 



(45) 

( r j — ~r \) is the distance between the two dipoles and /zo is the mag- 
netic permeability of the vacuum. Dipolar interaction energy is strongly anisotropic: on a lattice, magnetic moments 
parallel to a bond interact ferromagnetically, while when they are perpendicular to the bond they interact antifcr- 
romagnetically. When magnetic moments are placed on a triangular or a square lattice the interaction is frustrated. 
Dipolar forces are long-range only in d = 3, because the energy per spin scales as Jd d r/r 3 . They are therefore 
marginally long-range in d = 3, while they are short-range in d = 1, 2. It should be remarked that elasticity is instead 
marginally long-range in d = 2. 

It can be shown in general that, for samples of ellipsoidal shape and when the magnetization M — /i \/V lies 
along the longest principal axis of the ellipsoid, the energy per volume of a system of dipoles on a lattice can be 
written as 



H 1 v 1 

V 2V ^ 3 o 



Ho\M\ 2 D 



(46) 



where Eq is an energy that depends on the crystal structure, and D is the so called demagnetizing factor, which 
is equal to 1/3 for spherically shaped samples, tends to D = for needle shape samples and to D = 1 for disk 
shaped ones. The well-known shape dependence of dip olar energy is hence accounted for by the highly frustrating 
antiferromagnetic demagnetizing term in formula (|46[) [216| |. We will see examples of energies of the form (|4"r?)) in 
subsection IVI A[ while in subsection IVI B 41 we will work out in detail a specific example corresponding to a realistic 
system. 

Us ing methods similar to tho se introduced to prove the existence of a thermodynamic limit for short-range forces 
[278j , it can be shown [3(| Il64| that a system of dipolar spins posses a well defined bulk free energy, independent 



of sample shape, only in the case of zero applied field. The key to the existence of this thermodynamic limit is 
the redu ction in demagnetization energy when uniformly magnetized regions break into ferromagnetically ordered 
domains [207j . Technically, the proof is performed by showing that under the hypothesis of zero field the free energy 
is additive, using invariance under time reversal of the dipole energy, which is a consequence of its bilinearity in 

In the future, magnetic dipolar systems might constitute a field where the results on the statistical mechanics of 
systems with long-range will find a fruitful application. Besides the example discussed in subsection IVI B 41 holmium 
titanate materials [HJ , where dipolar interactions dominate over Heisenberg exchange energy, deserve some attention. 
The possibility to perform experiments with single domain needle shaped dipolar materials has been also stressed 31 
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F. Small systems 



As we have seen, the presence of long-range interactions causes the lack of additivity in macroscopic systems. 
However, even if the interactions are short-range, systems of a linear size comparable to the range of the interaction 
are nonadditivc. Then, we should expect that some of the peculiar features found for macroscopic systems with long- 
range interactions are present also in microscopic or mcsoscopic systems. Examples of this sort are: atomic clusters, 
quantum fluids, large nuclei, dense hadronic matter. 

As it will be shown in the following section, the study of phase transitions is very important for the character- 
ization of the properties of macroscopic systems with long-range interactions, especially in relation to the issue of 
ensemble inequivalence. However, phase transitions do occur also in atomic clusters (liquid-gas and solid-liquid tran- 
sitions), quantum fluids (Bose-Einstein condensation or super-fluid transition), large nuclei (liquid-gas transition), 
dense hadronic matter (formation of quark-gluon plasma) . Extensive studies have been devoted to phase transitions 
in the thermodynamic limit. This limit introduces simplifications in the analytical treatment, also in the case of 
long-range interactions. On the contrary, a consistent theory of phase transitions for small systems has not yet been 
developed. Signatures of phase transitions in finite systems are, however, often found both in numerical and laboratory 
experiments. 

Gross has focused his attention on the theoretical treatment of "Small" systems |l65l 1 1661 . Il67| . His approach 
privileged the use of the microcanonical ensemble. Therefore, he immediately realized the possibility that specific 
heat could be negative and pointed out the feasibility of experiments with heavy nuclei. 

According to Chomaz and collaborators, the key point i s to begin with the general definition of entropy in the 
framework of information theory (see Refs. [HI, [nl, OH [llj, El [HI]). In order to treat on the same ground 
classical and quantum systems, Chomaz introduces the density matrix 

D = J2\^n)Pn(^n\ , (47) 
n 

where \^ n ) are the states of the system and p n their probabilities. The entropy is thus defined by 

S[D] = -TrDhiD. (48) 

Different Gibbs ensembles are obtained by maximizing S[D] with respect to the probabilities p n under some con- 
straints. Each set of constraints defines an ensemble. The different number and functional forms of the constraints 
are related to the different physical situations. Within this framework, the finiteness of small systems is described by 
the introduction of specific constraints. It has been shown that phase transitions in finite systems can be equiva lently 
signalled by three different effects. The first one is the bimodality of the density of states as a function of energy [2131 ] . 
with the distance between maxima corresponding to different phases scaling as the numb er o f particles. The second 
one is a negative slope of the microcanonical caloric curve, i.e. a negative specific heat j232{ . The third one is the 
presence of anomalously large fluctuations in the energy partition between potential energy and kinetic energy. When 
the interactions are short-range, all these features give rise in the thermodynamic limit to the usual phase transitions, 
with the disappearance of the negative slope of the caloric curve. It is also possible to find a further signature of a 
phase transition, that makes a connection with Yang-Lee theory of phase transitions. In this latter theory, all zeroes 
of the partition function lie in the complex plane of the temperature for the canonical ensemble or of the fugacity 
for the grand-canonical ensemble, with an imaginary part different from zero as long as the system is finite. Phase 
transitions in the infinite system are associated to the approach of some of these zeros to the real axis, as system size 
increases. It has been stressed th at th e way in which zeroes approach the real axis may serve as a classification of 
phase transitions in finite systems [113| . 

Some of the previously mentioned signatures of phase t rans i tion s in finite systems have been also experime ntally 
reported. We mention here experiments on atomic clusters jl63l . |285| ] , and experiments on nuclear fragmentation [i~23| . 
In all experiments, the microcanonical caloric curve is compatible with the presence of an energy range where specific 
heat is negative. 

The first set of experiments is realized using atomic sodium clusters NaJi 7 and hydrogen cluster ions H^(H2) m <i4. 
In the first case, the negative specific heat has been found in correspondence to a solid-liquid phase transition, while 
in the second case in the vicinity of a liquid-gas transition. Sodium clusters |285l | are produced in a gas aggregation 
source and then thcrmalizcd with Helium gas of controlled temperature and selected to a single cluster size by a first 
mass spectrometer. Energy of the clusters is then increased by laser irradiation leading finally to evaporation. A 
second mass spectrometer allows the reconstruction of the size distribution, and correspondingly of their energies. 
Performing this experiment at different temperatures of the Helium gas, a caloric curve is constructed. The procedure 
assumes that, after leaving the source, a microcanonical temperature can be assigned to the clusters. Conceptually, 
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this is probably the most delicate point of the experiment. A region of negative specific heat, corresponding to the 
solid-liquid transition, has been reported. 

In the second set of experiments, performed with hydrogen cluster ions [l63l |. the energy and the temperature are 
determined from the size distribution of the fragments after collision of t he cl uster with a Helium projectile. This is 
done using a method introduced in Rcf. [44[ ■ The reported calo ric cu rve [l63| shows a plateau. Work along this line 
is in progress and seems to show a negative specific heat r egion |l5l| . corresponding to a liquid-gas transition. 

In the third set of experiments on nuclear fragmentation [1231 ] , the presence of negative specific heat is inferred from 
the event by event study of energy fluctuations in excited Au nuclei resulting from Au + Au collisions. The data seem 
to indicate a negative specific heat at an excitation energy around 4.5 McV/u. However, the signature corresponds 
to indirect measurements, and the authors cautiously use the word "indication" of negative specific heat. 

IV. EQUILIBRIUM STATISTICAL MECHANICS: ENSEMBLE INEQUIVALENCE 

Our purpose in this section is to propose a fully consistent statistical mechanics treatment of systems with long- 
range interactions. An overview of the different methods (saddle-point techniques, large deviations, etc.) will be 
presented. Simple mean-field models will be used both for illustrative purposes and as concrete examples. Most of the 
features of long-range systems that we will discuss are valid also for non mean-field Hamiltonians as will be discussed 
in Sec. ED 

A. Equivalence and inequivalence of statistical ensembles: physical and mathematical aspects 

1. Ensemble equivalence in short-range systems 

In short-rang e sys tems, statistical ensembles are equivalent. A thorough proof of this result can be found in the 
book by Ruelle [278(. Let us first discuss the physical meaning of ensemble equivalence in order to clarify why for 
long-range interacting systems equivalence does not always hold. 

The three main statistical ensembles arc associated to the following different physical situations: 

• a completely isolated system at a given energy E: microcanonical ensemble; 

• a system that can exchange energy with a large thermal reservoir characterized by the temperature T: canonical 
ensemble; 

• a system that can exchange energy and particles with a reservoir characterized by the temperature T and the 
chemical potential fi: grand canonical ensemble. 

Equivalence of the ensembles relies upon two important physical properties: 

i) in the thermodynamic limit, excluding critical points, the relative fluctuations of the thermodynamic parameters 
that are not held fixed (e.g. energy in the canonical ensemble) vanish; 

ii) a macroscopic physical state that is realizable in one ensemble can be realized also in another (equivalence at 
the level of macrostates). 

Let us concentrate our attention on the second item and let's refer to the equivalence between microcanonical 
and canonical ensemble: an isolated system with a given energy has an average temperature, i.e., an average kinetic 
energy. If instead we put the system in contact with a thermal bath at that temperature, we have an average energy 
equal to the energy of the isolated system. Therefore there is a one-to-one correspondence between energy values and 
temperature values. Actually, in the presence of phase transitions, this statement has to be made more precise, as we 
will comment in subsection IIV A"2] 

The practical consequence of ensemble equivalence is that, for computational purposes, one has the freedom to 
choose the ensemble where calculations are easier, and typically this is not the microcanonical ensemble (it is easier 
to integrate Boltzmann factors than 5-functions). Thus, in spite of its fundamental importance in the construction of 
statistical mechanics, the microcanonical ensemble is practically never used to perform analytical calculations. On the 
contrary i t is v ery much used in numerical simulations, since it constitutes the fundamental ingredient of molecular 
dynamics |X60f | . 

Ensemble equivalence is mathematically based on certain properties of the partition functions . To illustrate this 
point, we again consider the microcanonical and canonical ensembles, referring the reader to Ruelle [278:j for a complete 
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and rigorous discussion. A more precise definition of the microcanonical partition function (see formula {6J) of a system 
in d = 3 with N particles confined in a volume V is given by 

Q(E, V,N) = ±J dq 3N dp 3N 5 (E - H(p, q)) , (49) 

where the domain of integration is the accessible phase space T. 

For lattice systems the definition of is slightly different. There is no explicit volume dependence (because volume 
is fixed once the lattice constant and N are give n) and no N\ term due to the distinguishability of the lattice sites 
(for more details on the Gibbs paradox see [183| ). The microcanonical partition function is in this case 

» N N 

n lattlce (E,N) = / Hd^Udp.S (E - K({ Pl }) - U({ qi })) , (50) 

"* i i 

where and Pi are the conjugate variables attached to site i (see formula ([5])) and K the kinetic energy. 
The entropy is defined by 

S[E,V,N) =]nSl(E,V,N). (51) 

The thermodynamic limit corresponds to N — > oo, E — ► oo and V — > oo such that N/V — > n and E/N — > e, where 
the density n > and the energy per particle e are finite. The limit 

s{s,n)= lim l^.V.JV) (52) 

N^oo JV 

exists and gives the entropy per particle. The function s(e, n) is continuous, increasing in e at fixed n, so that the 
temperature 

T=(ds/dey 1 (53) 

is positive. Measuring the temperature by this formula seems hardly feasible. However, for Hamiltonians wi th k inetic 
energy, it can be shown that T coincides with the average kinetic energy, which is accessible experimentally |280j . For 
short-range systems, s(e, n) is a concave function of e at fixed n, i.e. 

s (cei + (1 - c)s2,n) > cs(ei,n) + (1 - c) s(s2,n) (54) 

for any choice of E\ and 62, with < c < 1 (in lattice systems without kinetic energy the energy can be bounded from 
above, and in turn this implies that T can be negative; however, concavity is still guaranteed if the interactions are 
short-range). This property is important in connection with the partition function of the canonical ensemble, given 
by 

Z(/3, V, N) = i J dq 3N dp m cxp HMT(p, q)\ , (55) 

with (3 > the inverse temperature. A similar definition of a canonical lattice partition function can be given, as 
done for the microcanonical lattice partition function (|50[> . In the thermodynamic limit, the free energy per particle 
is 

f(P, n) = ~ Vto^ ^ In Z(J3, V, N) . (56) 

Moreover, at fixed n, the function cf>({3,n) = (3f(f3,n) is concave in /3. In the following <j>(P,n) will be called the 
rescaled free energy. 

The equivalence between microcanonical and canonical ensemble is a consequence of the concavity of <p and s and 
of the relation between these two functions given by the Legendre-Fenchel Transform (LFT). Indeed, one can prove 
that </>(/3, n) is the LFT of s(e, n) 



</>(0, n) = Pftf, n) = inf [fie - s(e, n)] 



(57) 
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A brief sketch of the proof goes as follows 

exp(-0Nf(p,n)) = Z(fi,V,N) (58) 

(59) 

(60) 

(61) 

where the last equality is valid for large N. The saddle point of the last integral gives formula ([57]) . Let us remark 
that it would not have been necessary to hypothesize the concavity of </> in (3 from the beginning, because it follows 
from the fact that the LFT of a generic function is a concave function. Also the inverse LFT holds, since s(e,n) is 
concave in e 



Z{d,V,N) 

■^JdEj dq 3N dp 3N 5(H(p,q) - E)exp(-f3E), 
J dEQ(E,V,N) exp(-0E), 
dE exp (-N [/3e - s(e, n)]) , 



s(e, n) = inf [(3e — </>(/?, n)] . 



(62) 



This indeed proves ensemble equivalence, because for each value of (3 there is a value of e that satisfies Eq. ([57]) . and, 
conversely, for each value of e there is a value of (3 satisfying Eq. (|62|) . Fig. [6] provides a visual explanation of the 
relation between s and <f> and of the correspondence between s and (3. 



s(e) 



4>(p) 



slope=p 



slope=e it 



LFT 



LFT 




B* 



B 



FIG. 6: Relation between the entropy per particle s(e, n) and cf>(/3, n) = /3/(/3, n) (where f(f3, n) is the free energy per particle) 
by the Legendre-Fenchel Transform (LFT), n is fixed. 



Relation s sim ilar to those described in this subsection explain the equivalence of other ensembles. For instance 
Van Hove 304| proved that the equation of state for a short-range classical system is the same in the canonical and 
grand-canonical ensemble. This implies that isothermal compressibility is positive in both ensembles. 
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2. Phase separation and Maxwell construction in short-range systems 



It is important to discuss ensemble equivalence in the presence of phase transitions. We will s ee th at some interesting 
features arise. Phase transitions are associated to singularities of thermodynamic functions [l83j |. Therefore, in the 
microcanonical and canonical ensemble, they will be signaled by discontinuities in a derivative of some order of the 
entropy s or the rescaled free energy <p. 

Let us concentrate on the dependence of s and 4> on e and (3, respectively. Consider for example a model in which, 
for some choice of parameters, s(e) has a zero curvature in some energy range [ei, £2]- At both extremes of this interval 
the second derivative of s has a discontinuity (see Fig. [7]). Within the range [ei, £2], the function is not strictly concave 
(i.e., Eq. (|54[) is satisfied with an equality). All energy values in this range correspond to the same value of the inverse 
temperature (3 = f3t, the slope of the straight segment of s(e) in Fig. [7l For each energy in this range, the system 
separates in two phases of different energies £1 and £2. Therefore, the energy will be given by 

e — c£± + (1 - c) e 2 (63) 

where c is the fraction of phase 1 and, of course, 1 — c the fraction of phase 2. We emphasize that this relation 
is a direct consequence of additivity. It is easy to show that the Legendre-Fenchel transform of s(e), the rescaled 
free energy (shown in Fig. [7|) has a discontinuity in the first derivative with respect to (3 at (3t ((d4>/d(3) T = £1,2)- 
Following Ehrcnfcst's classification, this is a first order phase transition. 

Let us remark that in this example there is no one-to-one correspondence between e and (3: several microcanonical 
macroscopic states are represented by a single canonical state. This shows that, in the presence of first-order phase 
transitions the relation between the ensembles must be considered with care. We would not say that the ensembles are 
inequivalent in this case, which is a ma rginal one; therefore we do not adopt the term used in mathematical physics 
in this case: "partial equivalence" [300j | . 

Tuning the parameters of the model in such a way that the straight segment in Fig. [7] reduces to one point, one 
recovers a strictly convex entropy function and a one-to-one correspondence between e and (3. Consequently, the 
discontinuity in the first derivative of the rescaled free energy is removed. This is the case of a second order phase 
transition. 




FIG. 7: Entropy s(e) and rescaled free energy 4>{f3) in the case of a first order phase transition. The inverse transition 
temperature is j3t and [ei,£2] is the energy range of phase coexistence. 

It could also happen that the entropy, instead of showing the straight segment of Fig. [7J has a convex region (see 
the full line in Fig. [8^)- For shor t-range interac tions this is what is obse rved for fi nite systems near a phase transition. 
It has been shown numerically [l!3l Ill4 , Il65| | and for simple models |235l . |236| | that by increasing system size the 
entropy approaches the "concave envelope" which is constructed by replacing the full line in the energy range [£i,£2] 
by the straight thick dashed line in Fig. UK- In statistical mechanics, this procedure goes under the name o f Max well's 
construction and is mostly known in connection with the Van der Waals theory of liquid-gas transition [l83| . The 
relation between the construction of the "concave envelope" and the Maxwell construction can be easily established 
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by looking at the following relation 



»(ea) = *(ei)+ / ' & P(e) , (64) 



which derives from the definition of the inverse temperature = ds/de, which is plotted in Fig. [SJd as a function of 
energy. Another way of obtaining s(e2) is by integrating along the thick dashed line in Fig. [8^- 

s(e 2 ) = s(ei) + (e 2 - £i)/3* , (65) 

where /3f = /3(ei) = P(e2)- This implies that 

/ 2 de/3(e) = (ea-ei)A. (66) 
Splitting the integral in two intervals [£1,63) and [£3,62] one gets 

(£1 - e a )0t + f 3 ^ /3(e) = (s 2 - e 3 )p t + / ' dp 0(e) . (67) 



The value £3 is the one obtained from the entropy by looking where, in the convex region, the entropy has slope 0t- 
Condition (|67p is equivalent to the equality of the areas A\ and A 2 in Fig. [5] Introducing the generalized free energy, 
which is a function of both energy and inverse temperature, 

/(/3,£)=£--U(£), (68) 



one obtains from Eq. (p7|) that 

f(0 t ,e 1 )=f(0 t ,e 2 ) . (69) 

This shows that the requirement that the entropy is concave is equivalent to Maxwell's equal areas construction 
and, in turn, equivalent to demand that the generalized free energies, computed at the transition inverse temperature 
0t and at the two energies S\ and £2 which delimit the coexistence region, are equal (and looking at Fig. [8ji, also 
equal to f(0t)-) 

The Maxwell construction is related to the application of a maximum entropy principle for additive systems. Indeed, 
for all energies in the range (£i,£ 2 ), the entropy corresponding to the full line in Fig. [SJa is smaller than the entropy 
corresponding to the dashed line at the same energy. This latter entropy is related to a system which has performed 
phase separation and is therefore obtained as a mixture composed of a certain fraction of a state with energy e± and 
the remaining fraction with energy £ 2 , as in formula (|63p . Having this latter system a larger entropy, the natural 
tendency will be to phase separate. Hence the "concave envelope" recovers maximum entropy states. 

It should be remarked that the truly "locally" convex part of the entropy is the one in the range [e a , £&], while the 
range [ei, £2] is "globally" convex. One should therefore expect a difference in the properties of the physical states in 
the various ranges. Indeed, states in the range [£ a ,£fc] are unstable (dotted line in Fig. [SJd), while states in the ranges 
[£1, £ a ] and [£&, £2] are metastable (dashed lines in Fig.[Sj3): at a solid-liquid phase transition they would correspond to 
superheated solids and supercooled liquids, respectively. While the unstable states cannot be observed, the metastable 
states are observable but are not true equilibrium states, because higher entropy phase separated states are accessible. 

Considering again small systems, the nucleation of a bubble might lead to creation of a convex entropy region. 
Indeed, once a bubble of phase 1 is nucleated in phase 2, the energetic cost of the interface is proportional to the 
surface, while the energetic gain is proportional to the volume. If the system is s mall enough, these two energies 
might be comparable, implying that the additivity property is not satisfied (461 |237( |. 



3. Ensemble inequivalence in long-range systems: negative specific heat 



As already anticipated in the Introduction, an important physical property of systems with long-range interactions 
is that ensembles can be inequivalent. This means that experiments realized in isolated systems, described by the 
microcanonical ensemble, may give different results from similar experiments performed with well thcrmalizcd systems, 
for which the canonical ensemble is the appropriate one. For instance, while the specific heat will turn out to be 
always positive for a system in contact with a heat bath, it might be negative for an isolated long-range system. 




FIG. 8: a) Schematic shape of the entropy s as a function of the energy e (solid line) showing a "globally" convex in region 
in the range [£1,62], the thick dashed line realizes the "concave envelope", b) Inverse temperature (5 as a function of energy 
e. According to the Maxwell's constructions Ai = A2. The curve /3(e) represents states that are stable (solid line), unstable 
(dotted line) and metastable (dashed lines). 



When the interactions are long-range, an entropy function with a convex "intruder" [165| like the one shown by 
the solid line in Fig. [8^1 can represent truly stable equilibrium states. In Sec. IIVB1 we will give a concrete example to 
illustrate this important property Here, we will develop some general considerations which are not specific to a given 
model. 

The construction which has led to the "concave envelope" for short-range systems cannot be realized for long- 
range systems. On the one hand the same notion of phase is ill defined for long-range systems (which are inherently 
inhomogeneous). On the other hand, even if a definition of phase would be possible, the lack of additivity of long-range 
systems would not allow to obtain a mixed state and, in particular, to derive relations like (|63p . 

The starting point for the construction of a consistent thermodynamics of long-range systems is the calculation of 
microcanonical entropy associated to a given macrostate. A microstate is defined by the phase space variables of the 
system, and thus it refers to a precise microscopic state, while a macrostate is described in terms of a few macroscopic 
or coarse-grained variables, and then it generally defines a large set of microscopic states, all of them giving rise to 
the same values of the macroscopic variables. The derivation of free energy from microcanonical entropy using the 
Legendre-Fenchel transform ([57)) is still valid for long-range systems, ensuring that the function <fi is concave also 
for these systems. However, when the entropy has a convex region, the inversion of the Legen dre-F enchel transform, 
Eq. (jnij) does not give the correct microcanonical entropy, but rather its "concave envelope" [30d |. Physically, this 
implies a lack of equivalence of ensembles at the level of macrostates, i.e. all microcano nical macrostates with energies 
between E\ and £2 do not have a corresponding macrostate in the canonical ensemble [14-41 ] . 

The existence of a convex "intruder" in the entropy-energy curve, as in Fig. [5^, is associated to the presence of 
negative specific heat. Indeed, 

— - 1 

where the heat capacity at fixed volume is Cy = dE/dT. Hence, in the energy range [e a ,£&], the convexity of the 
entropy, d 2 S/dE 2 > 0, implies that the heat capacity is negative Cy < 0. This, in turn, implies that the conveniently 
normalized specific heat cy = Cy /N is also negative. 

In the canonical ensemble, the specific heat is always positive, even if the interactions are long-range. This is a 
straightforward consequence of the concavity of the function 0, which is given by Eq. (J57J also for long-range systems. 
Indeed, 

^ = -£K <0 (71) 

d(3 2 T 2^ U ' y' 1 ) 

implying that cy > 0. There is a subtlety related to the calculation of cy at (3 t - For a first order phase transition, 
since there is a discontinuity of the first derivative of <f> at (3t, the specific heat is not well defined and one rather speaks 
of latent heat, related to the jump (£2 — £1) of the energy as shown in Fig. [8]b. At second order phase transitions, the 
second derivative of <j) is instead well defined and is discontinuous at (3 t . 
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As explained above, the presence of a convex "intruder" like in FigJSJi in the entropy of long-range systems does not 
imply the appearance of singularities in the entropy, and therefore it could be doubted that this behavior signals a 
true phase transition in the microcanonical ensemble. Since this feature has been found first for gravitational systems, 
it is sometimes called "gravitational phase transition" . We will better clarify this issue analyzing what happens in the 
canonical ensemble. The rescaled free energy </>(/?) is again expressed by Eq. (|57|) and the mean value of the energy is 

TO 

The plot of £•(/?) is obtained from the curve in Fig. [8b by considering the ordinate (3 as the control variable. The 
Maxwell construction is realized by the horizontal dashed line at /3(ei). If, in the canonical ensemble, we start from 
a value of /3 such that the energy of the system is less than E\ and we gradually decrease (3, the system will reach the 
energy E\ and then will jump to the energy £2, and after the jump /3 will decrease continuously. Therefore, while in 
the microcanonical ensemble there is no singularity of the entropy, in the canonical ensemble there is a discontinuity 
of the derivative of the rescaled free energy <fi, corresponding to a jump in the energy (associated to a latent heat). 
In the canonical ensemble the system has therefore a first order phase transition. Equilibrium macroscopic states 
with energies in the range [£1,22] do not exist, since the lack of additivity, as we noted, does not allow, contrary 
to short-range systems, to have mixtures of states as in Eq. (|63[) . The temperature of the phase transition in the 
canonical ensemble is obtained by the Maxwell construction. Correspondingly, microcanonical microstates exist in 
the energy range [£i,£2] and phase separation is not thermodynamically favoured in this ensemble. 

The fact that the presence of a canonical first order phase transition is necessary to obtain ens embl e inequivalence 
was conjectured in Ref . [13] • This statement has been put on a more r igoro us basis in Refs. [6(| Il44j , analyzing the 
convexity properties of the entropy s(e). In fact, it has been shown |l44j that if the rescaled free energy <t>(/3) is 
differcntiable, then the entropy s(e) can be obtained by its Legendre-Fenchel transform. This applies also for second 
order phase transitions, when the second derivative of 4>(0) is discontinuous. Therefore, in the presence of a second 
order phase transition in the canonical ensemble, the microcanonical and canonical ensembles are equivalent. 

In this subsection we have discussed in detail the case where no singularity are present in the entropy. Although 
already showing all the features of ensemble inequivalence, this case is not generic and we'll discuss in the next 
subsection a model that has a second order phase transition in the microcanonical ensemble and still a first order 
transition in the canonical ensemble. 

Let's conclude this subsection with a remark. We have remarked that energies between E\ and £2 correspond to 
the same value of (3 in the canonical ensemble. It is interesting to figure out what happens if an initially isolated 
system with negative specific heat and with an energy between e± and £2, is put in contact with a heat bath that has 
its inverse temperature (ibath- Looking at Fig. [5] can be of help to understand the argument. We consider the case 
where the energy of the system lies in the range in which the specific heat is negative [e a , £&] when the system is put 
in contact with the bath. Let us take for instance point U in Fig. [5] as an initial point. We are interested to study 
the behavior of the system subjected to small perturbations, so that it can still be considered to be initially close to 
a microcanonical system. We sec immediately that the system becomes unstable. In fact, if it gets a small amount 
of energy from the bath, its temperature lowers (negative specific heat!), and therefore further energy will flow from 
the bath to the system, inducing a lowering of the system's temperature and then creating an instability. If, on the 
contrary, the initial energy fluctuation decreases the system's energy, its temperature rises, inducing a further energy 
flow towards the bath, and, hence, an increase of system's temperature. Thus, in contact with a heat bath, the system 
does not maintain energies in which its microcanonical specific heat is negative. The flow of energy started by the 
initial energy fluctuations stops when the system reaches again the same temperature of the bath, but at an energy 
for which its specific heat is positive. Looking at Fig. [91 it is clear that this could be cither outside the range [ei,^], 
i.e. point S, or inside this range, point M. This feature is valid for all points U inside ]£ a ,£h]. Once in M, the system 
will be in a thermodynamically mctastable state and a sufficiently large fluctuation in the energy exchange with the 
bath will make it leave this metastable state, ending up again in a state with energy outside [£i,£2], i.e. point S, 
which has the same inverse temperature of the bath (3bath- If the system instead jumps directly from U to S, it will 
stay there because this point lies on a thermodynamically stable branch. 



B. An analytical solvable example: the mean-field Blume-Emery-Grifnths model 

We have presented above the main physical and mathematical aspects related to ensemble equivalence or inequiv- 
alence in the study of long-range systems. Other mathematical approaches and tools, that exist, will be presented in 
connection with concrete examples. Actually, this subsection is dedicated to a toy model that exhibits all features 
that have been discussed so far, in particular ensemble inequivalence and negative specific heat in the microcanonical 
ensemble. Historically, the relation between first order phase transition and negative specific heat for long-range 
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FIG. 9: Inverse temperature /3 as a function of energy e. The Maxwell's construction is shown by the dotted line, while the 
dash-dotted line indicates the inverse temperature of the bath Pbath- U denotes an unstable macroscopic state with negative 
specific heat, while M and S are metastable and stable, respectively. 

systems in the thermodynamic limit was first pointed out in Refs. (hi . l299l |. The phenomenology we are going to 
discuss in this section has been heuristically described in Ref. [l2j . 



The Blume-Emery-Griffiths (BEG) model is a lattice spin model with infinite range, mean-field like interactions 
whose phase diagram can be obtained analytically both within the canonical and the microcanonical ensembles. This 
study enables one to compare the two resulting phase diagrams and get a better understanding of the effect of the 
non-additivity on the thermodynamic behavior of the model. 

The model we consider is a simplified version of the Blume-Emcry-Griffiths model [49| . known as the Blume-Capel 
model, where the quadrupole-quadrupole interaction is absent. The model is intended to reproduce the relevant 
features of superfluidity in He 3 -He 4 mixtures. Recently, it has also been proposed as a realistic model for metallic 
ferromagnetism plj . It is a lattice system ([5]), and each lattice point i is occupied by a spin-1 variable, i.e., a variable 
Si assuming the values Si = 0, ±1. We will consider the mean-field version of this model, for which all lattice points 
are coupled with the same strength. The Hamiltonian is given by 



where J > is a ferromagnetic coupling constant and A > controls the energy difference between the ferromagnetic 
Si = 1, Vi, or Si = — 1, Vi, and the paramagnetic, Si = 0,Vi, states. In the following we will set J = 1, without loss 
of generality since we consider only ferromagnetic couplings. The paramagnetic configuration has zero energy, while 
the uniform ferromagnetic configurations have an energy (A — 1/2) AT. In the canonical ensemble, the minimization 
of the free energy F = E — TS at zero temperature is equivalent to the minimization of the energy. One thus finds 
that the paramagnetic state is the more favorable from the thermodynamic point of view if E({±1}) > E({0}), which 
corresponds to A > 1/2. At the point A = 1/2, there is therefore a phase transition; it is a first order phase transition 
since, it corresponds to a sudden jump of magnetization from the ferromagnetic state to the paramagnetic state. 



1. Qualitative remarks 




(73) 
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FIG. 10: Elementary features of the phase diagram of the Blume- Emery-Griffiths model, showing the phase transitions on the 
temperature T and local coupling A axis, respectively 

For vanishingly small ratio A, the first term of Hamiltonian (|75|) can be safely neglected so that one recovers 
the Curie- Weiss Hamiltonian ([I]) with spin 1, usually introduced to solve the Ising model within the mean- field 
approximation. It is well known that such a system has a second order phase transition when T = 2/3 (we remind 
that we are adopting units for which J = l,kg = 1). Since one has phase transitions of different orders on the T 
and A axis (see Fig. I10p . one expects that the (T, A) phase diagram displays a transition line separating the low 
temperature ferromagnetic phase from the high temperature paramagnetic phase. The transition line is indeed found 
to be first order at large A values, while it is second order at small A's. 

2. The solution in the canonical ensemble 

The canonical phase diagram of this model in the (T, A) is known since long time [48|, |49|, [8(j. The partition 
function reads 

Z(fi,N)= £ exp(-/3AE^ + ^fE^M- (74) 

{S 1 ,...,S N } \ i=l \i=l / / 

Using the Gaussian identity 



fb f +OD 

exp(6m 2 ) = W— / dxexp(— bx 2 + 2mbx), 
V 7T J_ OD 

(often called the Hubbard-Stratonovich transformation) with m = J2i S%/N and b — NflJ/2, one obtains 



(75) 



Z(fi,N)= E ^(-^E^jVW / + °° dxe^(-^-x 2 + mNf3 x y (76) 



{Si,...,S N } \ i=1 

One then easily gets 



Z((3, N) = d%£ I dx exp(~N/3f(P, x)) (77) 



2tt 
where 

/(/?, x) = \x 2 -\ ln[l + e-^ A (e^ + e~^)\. (78) 
2 p 

The integral in (|77[) can be computed using the saddle point method where N is the large parameter. The free energy 
is thus 

f(p) =inf f(p,x). (79) 
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It is not difficult to see that the spontaneous magnetization (m) is equal to the value of x at the extremum which 
appears in Eq. (|75|) . We should also note that f((3,x) is even in x; therefore, if there is a value of x different from 
realizing the extremum, also the opposite value realizes it. This means that if the minimum x is equal to the 
system is in the paramagnetic phase, while if x ^ the system is in the ferromagnetic phase, where it can assume a 
positive or a negative magnetization. The phase diagram, in the (T, A) plane, is then divided into a paramagnetic 
region (x = 0) and a ferromagnetic one (x =/= 0). 

Let us now show that the two regions are divided by a second order phase transition line and a first order phase 
transition line, which meet at a tricritical point. As in the Landau theory of phase transitions, we find a second order 
transition line by a power series expansion in x of the function /(/?, x) in Eq. (|78p . The second order line is obtained 
by equating to zero the coefficient of x 2 , i.e., by the relation 

A c ^/3-i e < 3A -l = 0, (80) 

provided that the coefficient of a; 4 is positive, i.e., provided that 

B c = 4-e/ A >0. (81) 

The tricritical point is obtained when A c = B c = 0. This gives A = ln(4)/3 ~ 0.4621 and (3 = 3. The continuation 
of the critical line after the tricritical point is the first order phase transition line, which can be obtained by finding 
numerically the local maximum value x ^ (magnetic phase) for which f((3,x) is equal to /(/3,0) (paramagnetic 
phase), i.e., by equating the free energies of the ferromagnetic and the paramagnetic phases. The behavior of the 
function f(P,x) as (3 varies is shown in Fig. 1111 panel a) represents the case of a second order phase transition 
(A = 0.1) and panel b) the case of a first order phase transition (A = 0.485). 
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FIG. 11: Free energy f(/3, x) vs x for different values of the inverse temperature f3 = 1/T. Left panel shows the case of a second 
order phase transition, temperature values T =0.8 (dashed line), 0.63 (dotted), 0.4 (solid) when A = 0.1 are displayed. Right 
panel shows the case of a first order phase transition with A = 0.485 when T = 0.5 (dashed), 0.24 (dotted), 0.21 (dash-dotted), 
0.18 (solid). 

A picture of the phase diagram is shown in Fig. 1121 



3. The solution in the microcanonical ensemble 

The derivation of the phase diagram of the BEG model (|73|) in the microcanonical ensemble relies on a simple 
counting problem (42j, since all spins interact with equal strength, independently of their mutual distance. A given 
macroscopic configuration is characterized by the numbers N+, N-, Nq of up, down and zero spins, with + iV_ + 
Nq = N. The energy E of this configuration is only a function of N+, iV_ and Nq and is given by 



(82) 
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FIG. 12: Phase diagram of the Blume-Emery-Grimths model in the canonical ensemble. The second order transition line (solid) 
ends at the tricritical point (•), where the transition becomes first order (dotted). 



where Q = J2iLi <Sf = N+ + A_ (the quadrupole moment) and M = J2iLi &i = ~ N- (the magnetization) are 
the two order parameters. The number of microscopic configurations £1 compatible with the macroscopic occupation 
numbers N + , A_ and Nq is 

N\ 

^ = N+IN-W • (83) 
Using Stirling's approximation in the large N limit, the entropy, S = lnfi, is given by 



S = -N 



(1 — q) ln(l — q) + —(q + m) ln(g + rn) + — (q — m) \n(q — m) — gln2 



(84) 



where q = Q/N and m = M/N are the quadrupole moment and the magnetization per site, respectively. Equation 
(|82[) may be written as 

q = 2Ks + Km 2 , (85) 
where K = 1/(2A). Using this relation, the entropy per site s = S/N can be expressed in terms of m and e, as follows 

s(s, m) = -(1 - 2Ke - Km 2 ) ln(l - 2Ke - Km 2 ) - ^(2A'e + Km 2 + m) \n(2Ke + Km 2 + m) 

-i(2A"e + Km 2 - m) \n{2Ke + Km 2 - m) + (2Ke + Km 2 ) In 2 . (86) 

At fixed e, the value of m which maximizes the entropy corresponds to the equilibrium magnetization. The corre- 
sponding equilibrium entropy 

s(e) = sup s(e, m) (87) 

m 

contains all the relevant information about the thermodynamics of the system in the microcanonical ensemble. As 
usual in systems where the energy per particle is bounded from above, the model has both a positive and a negative 
temperature region: entropy is a one humped function of the energy. In order to locate the continuous transition line, 
one develops s(s,m) in powers of m, in analogy with what has been done above for the canonical free energy 

s = S + A mc m 2 + B mc m 4 + 0{m 6 ) , (88) 

where 

s = s(e, m = 0) = -(1 - 2Ke) ln(l - 2Ke) - 2Ke ln(Ae) , (89) 

and 

A ™ ^ -^W^-lKl' (90) 
All 

(91) 
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In the paramagnetic phase both A mc and B rnc are negative, and the entropy is maximized by m = 0. The continuous 
transition to the ferromagnetic phase takes place at A mc = for B rnc < 0. In order to obtain the critical line in the 
(T, A) plane, we first observe that temperature is calculable on the critical line (m = 0) using |53|) and (|89|) . One gets 



1 
T 



2K In 



1 - 2Kb 
Ke 



Requiring now that A mc = 0, one gets the following expression for the critical line 

exp\ft/(2K)] 







1 



(92) 



(93) 



Equivalently, this expression may be written as /3 = l/(2Ke). The microcanonical critical line thus coincides with the 
critical line (|80[) obtained for the canonical ensemble. The tricritical point of the microcanonical ensemble is obtained 
at A mc = B mc = 0. Combining these equations with Eq. (|92|) . one finds that, at the tricritical point, (3 satisfies the 
equation 



2/3 2 



1 + 2 exp - 



2K 



K 



1 

12 



= 



(94) 



Equations (|93|) and (|94l) yield a tricritical point at K ~ 1.0813, j3 = 3.0272. This has to be compared with the 
canonical tricritical point located at K = 1/(2A) = 3/ ln(16) ~ 1.0820, /3 = 3. The two points, although very close to 
each other, do not coincide. The microcanonical critical line extends beyond the canonical one. This feature, which is 
a clear indication of ensemble inequivalence, was first found analytically for the BEG model [42j and later confirmed 
for gravitational models I94l. 97[ . The non coincidence of microcanonical and canonical tricritical points is a generic 
feature, as proven in Ref. [60 1. 



4- Inequivalence of ensembles 

We have already discussed in general terms the question of ensemble equivalence or inequivalence in Sees. IIV A II 
and IIV A 31 Inequivalence is associated to the existence of a convex region of the entropy as a function of energy. 
This is exactly what happens for the BEG model in the region of parameters 1 < K < 31n(16). Since the interesting 
region is extremely narrow for this model [42| , it is more convenient to plot a schematic representation of the entropy 
and of the free energy (see Fig. I13p . We show what happens in a region of K where both a negative specific heat and 
a temperature jump are present. The entropy curve consists of two branches: the high energy branch is obtained for 
m = (dotted line), while the low energy one is for m ^ (full line). The m — branch has been extended also 
in a region where it corresponds to mctastable states, just to emphasize that these correspond to a smaller entropy 
and that it remains a concave function overall the energy range. We have not extended the m ^ branch in the 
high energy region not to make the plot confusing: it would also correspond to a metastable state. The two branches 
merge at an energy value Et where the left and right derivatives do not coincide; hence microcanonical temperature 
is different on the two sides, leading to a temperature jump. It has been proven in Ref. [60j ] . that for all types of 
bifurcation the temperature jump is always negative. In the low energy branch, there is a region where entropy is 
locally convex (thick line in Fig. 1 13[) . giving a negative specific heat according to formula (|70[) . The convex envelope, 
with constant slope /3 t is also indicated by the dash-dotted line. In the same figure, we plot the rescaled free energy 
4>(P), which is a concave function, with a point (3t where left and right derivatives (given by s\ and £2 respectively) 
are different. This is the first order phase transition point in the canonical ensemble. 

A schematic phase diagram near the canonical tricritical point (CTP) and the microcanonical one (MTP) is given in 
Fig. 1141 In the region between the two tricritical points, the canonical ensemble yields a first order phase transition at 
a higher temperature, while in the microcanonical ensemble the transition is still continuous. It is in this region that 
negative specific heat appears. Beyond the microcanonical tricritical point, temperature has a jump at the transition 
energy in the microcanonical ensemble. The two lines emerging on the right side from the MTP correspond to the 
two limiting temperatures which are reached when approaching the transition energy from below and from above (see 
Fig. 115b and I15t i). The two microcanonical temperature lines and the canonical first order phase transition line all 
merge on the T = line at A = 1/2. 

To get a better understanding of the microcanonical phase diagram and a lso in order t o com pare our results 
with those obtained for self-gravitating systems [H, [9?J and for finite systems [l!3l . Ill4 Il65l . Il66| , we consider the 
temperature-energy relation T(e) (also called in the literature "caloric curve"). Also this curve has two branches: a 
high energy branch (|92[) corresponding to m = 0, and a low energy branch obtained from (|53p using the spontaneous 
magnetization m s (e) ^ 0. At the intersection point of the two branches, the two entropies become equal. However, 
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FIG. 13: Left graph: schematic plot of the entropy s(e) as a function of energy density e for the BEG model in a case where 
negative specific heat coexists with a temperature jump. The dash-dotted line is the concave envelope of s(e) and the region 
with negative specific heat cv < is explicitly indicated by the thick line. Right graph: Rescaled free energy <j>((S): the first 
order phase transition point fit is shown. 




FIG. 14: A schematic representation of the phase diagram, where we expand the region around the canonical (CTP) and the 
microcanonical (MTP) tricritical points. The second order line, common to both ensembles, is dotted, the first order canonical 
transition line is solid and the microcanonical transition lines are dashed (with the bold dashed line representing a continuous 
transition). 

their first derivatives at the crossing point can be different, resulting in a jump in the temperature, i.e. a microcanonical 
first order transition. When the transition is continuous in the microcanonical ensemble, i.e. the first derivative of 
the entropy branches at the crossing point are equal, BEG model always displays, at variance with what happens 
for gravitational systems, a discontinuity in the second derivative of the entropy. This is due to the fact that here 
we have a true symmetry breaking transition 60]. Fig. 1151 displays the T(e) curves for decreasing values of K. For 
K = 3/ln(16), corresponding to the canonical tricritical point, the lower branch of the curve has a zero slope at the 
intersection point (Fig. [TSr). Thus, the specific heat of the ordered phase diverges at this point. This effect signals 
the canonical tricritical point through a property of the microcanonical ensemble. Decreasing AT, down to the region 
between the two tricritical points, a negative specific heat in the microcanonical ensemble first arises (dT/de < 0), see 
Fig. 115b . At the microcanonical tricritical point, the derivative dT/de of the lower branch diverges at the transition 
point, yielding a vanishing specific heat. For smaller values of AT, a jump in the temperature appears at the transition 
energy (Fig. 115b h The lower temperature corresponds to the m = solution (|92[) and the upper one is given by 
exp(/3/2A') = 2(1 — q*)/\/(q*) 2 — {m*) 2 , where m* 7 q* are the values of the order parameters of the ferromagnetic 
state at the transition energy. The negative specific heat branch disappears at even smaller values of K, leaving just 
a temperature jump (see Fig. I15H ). In the K — > 1 limit the low temperature branch, corresponding to q — m = 1 in 
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the limit, shrinks to zero and the m = branch ([92]) occupies the full energy range. 




FIG. 15: The temperature-energy relation in the microcanonical ensemble for different values of K. The dashed horizontal 
line is the Maxwell construction in the canonical ensemble and identifies the canonical first order transition temperature at the 
point where two minima of the free energy coexist. Thick lines identify negative specific heat in the microcanonical ensemble. 
We don't report the numerical values on the axes for the readability of the figure. 



C. Entropy and free energy dependence on the order parameter 

In this section, we will discuss in detail the dependence of both the canonical free energy and the microcanonical 
entropy on the order parameter. This will allow to understand more deeply the relation between the two ensem- 
bles by revisiting Maxwell constructions. Besides that, we will also discover an interesting physical effect, negative 
susceptibility, of which we will give an explicit example, see Sec. IIV E 41 



1. Basic definitions 

Let us start from Eq. (|86|) , that gives the entropy per site s(e,m), for the BEG model, as a function of the energy 
per site e and the magnetization m. This entropy is proportional to the logarithm of the number of configurations 
which have a given energy and a given magnetization. In the general expression Eq. (|50p . these configurations can 
be obtained by adding a further Dirac delta function in the integrand, so that only the configurations with a given m 
would be counted. Thus one gets 

1 r N 

s(e,m)= lim — In / TTdq* 6 {E - U{{m})) 5 (Nm - M({qs})) (95) 

J i 

where M is the total magnetization corresponding to configuration {q;}. For spin models, the local variable takes 
discrete values = Si, hence M = J^i &i an d the integral in Eq. ([55)) is replaced by a discrete sum. Besides that, 
there is no kinetic energy, hence only the potential energy appears H = U . The calculation of entropy (|95[) is often 
an intermediate step in the calculation of s(e), with M the order parameter. In order to get s(e), one computes the 
global maximum of the constrained entropy (|95[) . In the thermodynamic limit, this procedure is fully justified, since 
the relative contribution of all configurations corresponding to values of the order parameter that are different from 
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the one realizing the global maximum, vanishes. This is what has been done in our study of the BEG model, e.g. in 
Eq. (J571). 

In the canonical ensemble, the computation of the partition function for a given value of the order parameter, i.e. 
for the system at a given temperature and at a given magnetization m, can be obtained by adding a Dirac delta 
function to the integrand in Eq. (|55l) 



fW,m) 



-I lim i In V dSiexpl-pHttSiWSiNm-MttSi})) . 

P N— >oo I\ — ' 

{S lt ...,S N } 



(96) 



Thus, the free energy depends on both (3 and the magnetization m. Finally, a relation analogous to Eq. (|57|) holds 
between the entropy and free energy 



f(P,m) = inf 



(97) 



valid, as before, for all systems, independently of the range of the interactions. 
We are therefore led to the introduction of the generalized free energy (see Eq. 



/(/3,e,m) 



s(e,m) , 



(98) 



that will be used, as previously, to study the relation between microcanonical and canonical equilibrium states. 
Needless to say, it is not at all guaranteed that the function s(s, m) can be easily derived, in general, as we have done 
for the BEG model; nevertheless this general discussion is useful to show how the properties of this function explain 
the occurrence, or not, of ensemble equivalence. 

In the microcanonical ensemble, the entropy s(e) of the system at a given energy e is given by formula (|87[) . In the 
canonical ensemble, the free energy /(/3) of the system at a given inverse temperature f3 will be given by 



f(0) = w£f(J3,e,m) 



inf /(/?, to) = inf 

m s.m 



s(e, to) 



(99) 



as can be easily deduced by Eqs. (|87p. ([FT)) . (|97|) and (155)) . The two extremal problems (|87[) and ([M]) . that basically 
contain the single function s(e, m), can be employed to study ensemble equivalence. Suppose we fix (3 and solve 
the extremal problem (|99p . finding the values of e and to that realize an extremum. Then, we will have ensemble 
equivalence if the same values will realize the extremum in formula (|87[) while at the same time the derivative ds / de 
will be equal to the fixed value of /3. In conclusion, we seek in both extremal problems the solution of the following 
first order conditions 



ds_ 

dm 
ds 

~d~£ 



o 



0. 



(100) 
(101) 



We denote by £*(/?), m*(f3) the solution of the variational problem (|100p and (|101[) . Using (jlOip . it is straightforward 
to verify that 



dp 



'(PI 



(102) 



meaning that the value of e at the extremum is indeed the canonical mean energy. 

However, we have to consider also the stability of these extrema. We denote derivatives by subscripts, e.g., s m is 
the first derivative of s with respect to m. The only condition required by (|87[) is that s mm < 0. In order to discuss 
the stability of the canonical solution, one has to determine the sign of the eigenvalues of the Hessian of the function 
to be minimized in (|99p . The Hessian is 



n 



The extremum is a minimum if and only if both the determinant and the trace of the Hessian are positive 



-4s > 



(103) 



(104) 
(105) 
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This implies that s £e and s mm must be negative, and moreover s e£ < — s^ ne /\s mm \. This has strong implications on 
the canonical specific heat, which must be positive, as it has been shown on general grounds in subsection IIV A"3l 
see Eq. (fT9|) . Let us prove it by using the variational approach, instead of the usual Thirring argument [1771 1297| . 
that uses the expression of the canonical partition sum. Indeed, taking the derivatives of Eqs. (|100[) and (| 1 1[) with 
respect to /3, after having substituted into them e* (/3) , to* (/3) , one gets 

_ de* . dm* . . 

s -^ +Sem w = 1 (106) 

de* dm* 

s -dp +Smm W = °' (107) 

where all second derivatives are computed at £*(/?), m*(/3). Recalling now that the specific heat per particle at 
constant volume is 

■ < 108 » 

one gets 

c v =p \ 2 * m _ m . , (109) 

which is always positive if the stability conditions (|104|) and (|105p are satisfied. Since the stability condition in the 
microcanonical ensemble only requires that s mm < 0, a canonically stable solution is also microcanonically stable. 
The converse is not true: one may well have an entropy maximum, s mm < 0, which is a free energy saddle point, with 
s se > 0. This implies that the specific heat (|109|) can be negative. 

The above results are actually quite general, provided the canonical and microcanonical solutions are expressed 
through variational problems of the type ([57]) and (f9"9")l . The extrema, and thus the caloric curves T(e), are the sa me in 
the two ensembles, but the stability of the different branches is different. This aspect was first discussed by Katz |20l| 
in connection with self-gravitating systems (see also Ref. [94|). 



2. Maxwell construction in the microcanonical ensemble 



We have already discussed Maxwell's construction in subsection IIV A 21 We have shown that, for short range 
systems, where microcanonical and canonical ensembles are always equivalent, Maxwell's construction derives from 
the concave envelope construction for the microcanonical entropy. This in turn is a consequence of additivity and 
of the presence of a first order phase transition in the canonical ensemble. Here, we discuss Maxwell's construction 
for long-range systems, where the microcanonical entropy can have a stable convex intruder, leading to ensemble 
inequivalence. 

The study of the BEG model, in subsection IIV Bl has emphasized the presence of an extremely rich phenomenology 
In particular, in a specific region of the control parameter K, the canonical and microcanonical ensembles show a first 
order phase transition, with a forbidden energy range in the former ensemble and a temperature jump in the latter 
(see Fig. [T5b). Both the e(/3) curve and the /3(e) one become multiply valued if we include metastable and unstable 
states. Since we know that the Maxwell construction leads to an equal area condition for the /3(e) curve, which defines 
the phase transition inverse temperature /3t in the canonical ensemble, we wonder here whether a similar construction 
exists for the e(/3) relation which would lead to the determination of the transition energy e t in the microcanonical 
ensemble. 

In the following discussion of Maxwell's construction, it is crucial to understand the mechanism that generates 
multiple branches of the /3(e) curve. Let us define 

|9(g,m)= y d£ ' . (110) 

We have explained that the equilibrium magnetization to*, at any energy e, is the global maximum of the entropy per 
site Once to* has been computed, the equilibrium inverse temperature is given by /3(e) = /3(e, m*). However, 

also local maxima, local minima and saddles of the entropy exist, corresponding to different values of to. Following 
such critical points as a function of e, one determines the different branches of /3(e). In particular, we have the 
continuation at energies lower than the microcanonical transition energy et of the high energy to = branch (dashed 
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part of /3#(e) in Fig. [T6|) and the continuation at higher energies of the magnetized branch /3t(e). It's interesting to 
remark that the m = point remains an extremum for all values of e since s(e, m) is even in m. 

An example of inverse temperature /3(e) relation is plotted in Fig. [16] The lower branch (3l starts at low energy and 
ends at the energy eh, where its derivative becomes infinite. The upper branch (3h starts at high energy and ends at 
the energy el, where again its derivative is infinite. These two branches are connected by the vertical line at energy 
£f, and by the intermediate branch /3i that goes from el to eh- The equilibrium state is given by the lower branch 
for e < Et and by the upper branch for e > e t . The lower branch for e t < e < er (dashed) and the upper branch for 
El < £ < £ c (dashed) represent metastable states, while the intermediate branch represents unstable states (dotted). 
Therefore, increasing the energy, the equilibrium value of /3 jumps from the lower to the upper branch (thus following 
the vertical line) at the transition energy e t . It is easy to show that the vertical line realizes a Maxwell construction, 
i.e., that the two areas A\ and A 2 are equal. The curve (3(e) has therefore three branches, that we denote by /3^(e) 
(the low energy magnetized branch), /3/(e) (the intermediate branch of unstable states) and /3#(e) (the high energy 
paramagnetic branch). Then, we have 

A 2 - A 1 = I" /3 L (e)de + f " /3 7 (e)de + f ' f3 H (e)de (111) 

Jet J eh J el 

= {sl{£h) - s L (e t )) + {si{el) - si(e H )) + (sff(et) - s h {sl)) , (112) 

where in the r.h.s. Sj(e) is the function whose derivative gives the branch (3i(e), with i = H,I,L. We use now the 
continuity property of the entropy, imposing that sl(eh) = si(sh) and si(el) = sh{el)- Moreover, the transition 
occurs at the energy where the entropies of the low energy branch and of the high energy branch are equal, i.e., that 
s i(et) = sjj(£t)- We then obtain that A\ = A 2 , It should remarked that the values of the three branches of /3(e) at 
E t determine the size of the temperature jump. Indeed, /3i(e t ) = (3 L , /3/(e t ) = j3j and /3_y(e t ) = f3* H (see Fig fTB]) . 
The equal area condition implies that 

/ d/3 [e(/3) - et] = . (113) 

J&L 

Using e = d</>/d/3, one gets 

4>(fi H ) - 4>{f3 L ) - e t (f3 H - p L ) = , (114) 
which, after defining the generalized entropy 

s(/3,e)=/3e-0(/3) , (115) 

leads to 

s(PL,e t ) = s(P H ,e t ), (116) 

which is the condition equivalent to (f6T)|) . 

The first time Maxwell's construction appears for self-gravitating systems is in Ref. [20j . It was later extended to 
microcanonical phase tran sitions in Rcfs. [43l . |93| . On the other hand canonical and microcanonical caloric curves 
were studied in Ref. [29C| . It is also possible that the curve is made of several disconnected branches. We then 
emphasize that for more complex caloric curves than the one in Fig. 1161 the evaluation of Maxwell's areas must be 
done cautiously [13, [9?J • 



3. Negative susceptibility 

We have seen that ensemble inequivalence can give rise to negative specific heat in the microcanonical ensemble. We 
show here that another consequence of ensemble inequivalence is the existence of equilibrium microcanonical states 
with a negative magnetic susceptibility, a non negative quantity in the canonical ensemble. We will follow a treatment 
close to that of subsection IIV C ll 

Fixing the energy and the magnetization, the entropy is given by Eq. (|95p . From the first principle of thermo- 
dynamics, that for magnetic systems reads TdS = dE — hdM, with E and M the internal energy and the total 
magnetization of the system, respectively, it is straightforward to prove the following formula for the average effective 
magnetic field in the microcanonical ensemble (see also [79|) 
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FIG. 16: Typical shape of the /3(e) curve at a microcanonical first order phase transition. The transition energy e t is determined 
by an equal area A\ = A2 Maxwell's construction. All states are represented: stable (solid line), metastable (dashed lines) and 
unstable (dotted line). The inverse temperature jump is given by — /32- 



ds_ 

Me to) = — ^ = - — 

de 



(117) 



Taking into account that, like (3 is canonically conjugated to H , (3h is canonically conjugated to to, it is natural to 
define the following partition function 



Z((3,h,N)= exp{-0[Hm})-hM({Si})]}, 

{S U ...,S N } 



(118) 



where, as in Eq. ([55]) . M({Si}) is the total magnetization corresponding to configuration {Si}. Analogously to 
Eq. ([M]) . the free energy is 



f(/3,h) = -~ lim ±-]*Z(fl,h,N) = w£ 

p N^oo 1\ s, m 



e — hm — -xs(e, m) 

P 



(119) 



For h = 0, we obviously recover Eq. ([M]). As in subsection [IV C 1[ we sec that the relation between the two ensembles 
can be studied by analyzing the single function s(e,m). Taking into account Eq. (|117|) . the variational problem in 
Eq. (|119p , together with the variational problem that defines s(e) can be solved by imposing that 



ds 
dm 
ds 

d~e 



-hp 
-0, 



(120) 
(121) 



that generalize the conditions ()100p and (|10ip to h ^ 0, providing the functions e(j3 : h) and to(/?, h). Independently 
of the value of h, the stability conditions in the canonical ensemble are the same as those given in Eqs. (|104p and 
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(II -5 1) . In the microcanonical ensemble, since we are not maximizing with respect to to, we have no condition on the 
second derivative of s(e, to) with respect to to. 



Magnetic susceptibility is defined as 



9m , „„. 

X = ^r- (122) 



dh 



Deriving ()120[) and (|121| with respect to h, one obtains 



dh - ,n 8h 

from which we get 



de dm . 

Sme~Q^ s mm Pi (123) 

_ 9e <9to 

s -^ + s -UT = °- ( 124 ) 



X = -0 - . ^ 2 • (125) 

This formula is valid in both the canonical and microcanonical ensemble. However, the results can differ in the two 
ensembles because the quantities in this formula arc computed at different stationary points in the two ensembles. In 
the canonical ensemble, x 1S positive definite for all h because of the stability conditions (|104|) and (|105|) . 

As for the specific heat, the positivity of magnetic susceptibility in the canonical ensemble can be derived on general 
grounds from Eq. (|118[) . since it is easily shown that susceptibility is proportional to the canonical expectation value 
<(M-<M» 2 >. 

On the other hand, in the microcanonical ensemble, as already remarked, no condition on the second derivatives of 
s with respect to to is required, and therefore susceptibility can have either sign. Indeed, in Ref. [79^ it is shown that 
a simple 4 model can display a negative microcanonical susceptibility (see also subsection IIV E 51) . 

With ensemble equivalence, as it always happens in short-range systems, magnetic susceptibility is positive also 
in the microcanonical ensemble. Wc note that this result is also a byproduct of the convexity property discussed in 
subsection III C| since the attainable region in the (e,m) plane is necessarily convex for short-range systems. In fact, 
this implies that Eqs. (|104j) and (|105|) are satisfied for all equilibrium values (e,m). 



D. The Hamiltonian Mean Field model 



Very few examples are known in statistical mechanics where one can explicitly compute microcanonical entropy 
in ca ses where the variables are continuous. Everybody knows the perfect gas derivation of microcanonical entropy 
jl83l |. However, as soon as one considers interactions, the task becomes unfeasible. On the other hand, since for 
short-range systems microcanonical and canonical ensemble are equivalent and it is much simpler to perform integrals 
with Boltzmann weights rather than with Dirac delta functions, much more attention has been devoted to compute 
free energies. An exception is the study of gravitational systems for which, since it was clarified that ensembles can 
be non equiva lent [l77l |297| | , some attention has been devoted to calculations in different ensembles [94|, l97l . 
|259| . |260| . [282] . In particular mean- field models have been shown to be solvable using saddle point methods 
and phase transitions in the grand-canonical, canonical and microcanonical ensembles have bee n studied 
another line of research, simplified gravit ational on e-dimensional models have been considered [M Gil, of which 
microcanonical solutions have been found |244l . l28ll | . 

This section is devoted to the discussion of a mean-field model, the Hamiltonian Mean Field (HMF) model, whose 
potential keeps only the first mode of the Fourier expansion of the potential of one-dimensional gravitational and 
charged sheet models. Indeed, it turns out that the model is nothing but the mean-field version of the XY model (see 
e.g. Chapter 6.1 of Ref. [HI), which, however, had never been studied in the microcanonical ensemble. 

The HMF model has been extensively studied for more than a decade. The simple mean-field interaction allows us 
to perform analytical calculations, but maintains several complex features of long-range interactions. 

It has been shown that the behavior of certain wave-particle Hamiltonians can be understood using the HMF model 
as a reference. For instance, some equilibrium and non equilibrium properties of the HMF model can be mapped onto 
those of the Colson-Bonifacio model of a single-pass Free Electron Laser [37], [HI . 

In this section, we will derive both the canonical and the microcanonical solutions, that we anticipate to be 
equivalent. The microcanonical solution will be obtained by three different methods: a straightforward one, inspired 
by the solution of gravitational models ( subsection IIV D 3)) . and two more involved ones, using a variational procedure 
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( subsection IIV D 4]) or large deviations (subsection IIV E 3)) . This latter, although more complicated, will allow us to 
introduce a method of solution which has a much wider range of applications. 

Moreover, this model will serve as a paradigm for discussing important dynamical behaviors typical of long-range 
interacting systems, which will be presented in Sec. IV Al 



1 . Introduction 

The Hamiltonian Mean Field model [ll|, ll 10L Il25| | is defined by the following Hamiltonian 

N 2 T 

^ = Ey + ^E[ 1 - C0S ^-^)] . ( 126 ) 

i=l i,j 

where 0i € [0, 2n[ is the position (angle) of the i-th unit mass particle on a circle and pi the corresponding conjugated 
momentum. This system can be seen as representing particles moving on a unit circle interacting via an infinite range 
attractive (J > 0) or repulsive (J < 0) cosine potential or, alternatively, as classical XY-rotors with infinite range 
ferromagnetic (J > 0) or antiferromagnetic (J < 0) couplings. The renormalization factor N of the potential energy 
is kept not only for historical reasons, but also because, as we have explained in section UTAl in this way the energy 
per particle and temperature are well defined in the N — > oo limit. In the literature, some authors have treated the 
case in which the ener gy is not extensive, i.e. they remove the factor 1/N. This leads to different thermodynamic 
limit behaviors 0, l298| | . 

H istor ically, this model has been independently introduced in the conti nuou s time version in Refs. p a [l3 l [M 

187l . l279j | , and had been previously considered in its time discrete version |212| . 

We will solve the HMF model in both the canonical and microcanonical ensembles. Alter nativ ely, the model can 
be solved using the maximum entropy principle for the single particle distribution function [186], which is however 
suitable only for mean field models. 

It is very useful to rewrite the Hamiltonian (|126p in a different form, using the definition of the x and y components 
of the microscopic magnetization 

1 ^ 1 N 

cosOi and m u = — ^ sin 6*; . (127) 



i=l i=l 



We then easily find that 



N r? N J 

F N =^|4(l- m 2 ) . (128) 



In the following we will treat only the ferromagnetic case and we will set J = 1 without loss of generality. The 
antiferromagnetic case is less interesting for what equilibrium properties are concerned (the homogenous state is stable 
at all energies), but it dis plays inte resting dynamical features, like the formation of collective modes under the form 
of "biclusters" M, M, HI IM fTH . 



2. The canonical solution 



The canonical solution of this model can be easily derived. After the trivial Gaussian integration over the momenta, 
the canonical partition function reads 



z( ,, w) = cxp| _f)(^ w 



^ 2N 



' N 

E 



A' 



cos( 



sin 0i 



(129) 
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Using the Hubbard-Stratonovich transformation, sec ([75]), this expression becomes 



Z(0,N) 



exp 



■f ) (|) 



N/2 



N/3 . 

I daqd.T 2 exp <j TV 



8(x 2 +x 2 2 ) 



+ ]nIo(8(x 2 +x 2 )*) 



where Iq(z) is the modified Besscl function of order 



p2n p2ir 

Iq{z) = / d6 exp (z\ cos 6 + Z2 sin 6) = / d(9exp 
Jo Jo 



zcos( 



1 /2 

where z = (z\ + z|) . We can go to polar coordinates in the {x\,X2) plane, to obtain 



-?)(f 



N/2 



TV/3 / da; exp i TV 



Bx 2 

-^L+fnloCto) 



(130) 



(131) 



(132) 



In the thermodynamic limit TV — » oo, the integral in (|132p can be computed using the saddle point method, so that 
the rescaled free energy per particle is 



m = pus) 



- In 2tt + - In 8 + inf 

2 2 2 x>o 



lnJ (/3a;) 



(133) 



The extremal problem in x in the last equation can be easily solved graphically, by looking for the solution of the 
equation 



h{Bx) 
Iq(Bx) ' 



(134) 



where I\(z) is the modified Besscl function of order 1, which is also the derivative of Io(z). The graphical solution is 
made easier by the fact that, for real z > 0, the function Ii/Iq is positive, monotonically increasing and with negative 
second derivative (see, e.g., Ref. [75|, where a generalization of the HMF model is studied, and where these properties 
are proved for an entire class of functions that include Ii/Iq)- Fig. [T7l shows the graphical solution of (|134[) for two 
different values of 3. For 3 < 2 the solution is given by x = m* = 0, while for 3 > 2 the solution monotonically 
increases with 3, approaching to* = 1 for 8 — > oo. The solution m* = of (| 134[) . present for all values of 8 : is not 
acceptable for 8 > 2, since it's not a minimum in formula (|133[) . 
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FIG. 17: Graphical solution of Eq. (| 134[) for two values of 0, one above and one below the critical value j3 c = 2. In both plots 
the straight dashed line is the bisectrix, while the curved solid line is the ratio of modified Bessel functions on the r.h.s. of 
Eq. (|134|l . a) = 4: the equilibrium solution is m* 7^ 0; b) = 1.5: the only and equilibrium solution is m* = 0. 



It is not difficult to show that the value m* realizing the extremum in Eq. (]133[) is equal to the spontaneous 
magnetization. We emphasize that values of x different from this are not related to the magnetization of a state 
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different from equilibrium state, but that only the minimizcr m* , which is always between and 1. is the magnetization 
of the equilibrium state. This should be suggested also by the fact that the integration range in Eq. (|132[) goes from 
to oo. 

From the above procedure it is clear that the spontaneous magnetization is defined only in its modulus to*, while 
there is a continuous degeneracy in its direction. 

In conclusion, we have shown that the HMF model displays a second order phase transition at (3 C = 2 (T c = 0.5). 
The derivative of the rescaled free energy with respect to f3 gives the energy per particle 

e{P) = Yp + \-\( m *^)) 2 ( 135 ) 

As already evident from the Hamiltonian, the lower bound of e is 0. At the critical temperature the energy is e c = |. 
A plot of the function <p{(3) is shown in the next subsection, after the microcanonical solution. 



3. The microcanonical solution 



Since we have shown that the HMF model has a second order phase transition in the canonical ensemble, we 
could immediately conclude, according to the remark in subsection IIV A 31 that ensembles are equivalent for this 
model. Deriving the entropy would be straighforward using the inverse Legendre-Fenchel transform of the free energy 
computed in the previous subsection (see formula (|133p ). However, since in the following we will need the explicit 
expression of the microcanonical entropy of the HMF model both as a function of the energy and of magnetization in 
order to solve a generalized HMF model in the microcanonical ensemble, we will devote this subsection, the following 
and subsection IIV E 31 to the derivation of the entropy for the HMF model using several methods. The reason for 
such a thorough derivation is mainly pedagogical: we want to show the application of three different methods to a 
simple model. The first method is mutuated from similar ones used in the context of gravitational systems. The 
second one, more general, is a variational method based on a strong hypothesis on the form the partition function, 
and allows a straightforward derivation of the entropy, also in cases of ensemble inequivalence. The third method 
illustrates the application of large deviation theory, which is the most general currently available tool to solve systems 
with long-range interactions. 

The microcanonical solution has been heuristically obtained, under the hypothesis of concave entropy, in Ref. fioj 
and in a different form in Ref. [307| . 

The simplicity of the Hamiltonian makes it possible to obtain directly the thermodynamic limit of the entropy per 
particle, as we now show. First of all, let us introduce the method usually applied in self-gravitating systems [47| . 
Denoting by K and U the kinetic and the potential energy, respectively, the number of microscopic configurations 
corresponding to the energy E in a generic system is given by 

fl(E,N) = fY[dpid0i5(E-H N ) (136) 

i 

= J JJdpiddi J dKS^K-^^Pj HE-K-U({d t })) (137) 



(dK J n^M^-Ey) jXlMiHE-K-um})) . 

v ' V v ' 

fi k i„(i<") n cant {E-K) 



(138) 



The factor fikin, which is related to the surface of the hypersphere with radius R = \JlK in N dimensions, 
can be computed straightforwardly using the properties of the Dirac S function, obtaining the expression: fikin = 
2tt n / 2 R n ~ 2 /T(N/2). Using the asymptotic expression of the r-function, ]n.T(N) ~ (JV— 1/2) In N-N+ (1/2) ln(2vr), 
and keeping only the terms that do not give a vanishing contribution to the entropy per particle in the thermodynamic 
limit, one obtains 



il kin (K) - exp I — 

N 



N 

l + hi7r-ln— +ln(2A") 



(139) 



exp ( — [1 + ln(27r) + \nu] I , (140) 
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where u = 2K/N. Defining the configurational entropy per particle s con f(u) = (In fl con i(Nu)) /N , where u = U/N 
(E - K)/N = e - u/2, Eq. (fT3"g|) can be rewritten as 



Q(Ne,N) iV ~ +00 ^ / 



1 ln(27r) 1 



— In u + s r 



2 2 

Hence, solving the integral in the saddle point approximation, we obtain the entropy 



(u) 



s(e) = lim — ]nSl N (eN) 

= o + Ti M» + SUp 



lnu + s con{ (u) 



(141) 

(142) 
(143) 



We note that this expression is quite general, in the sense that it is valid for any system in which the kinetic energy 
assumes the usual quadratic form. To proceed further, we need an explicit expression for the configurational entropy 
Sconf: something which is generally not easily feasible. 

Now we use the fact that the potential energy of the HMF model, as evident in (|128[) . is a very simple function of 
the microscopic magnetization m = (m x ,m y ), with a one to one correspondence between the value of the potential 
energy U and the modulus of the microscopic magnetization m 2 



+ niy. In fact, if we define 



Q m (m) 



j~\\d9 l 5 \ cos 9, - Nrn^j 8 ^ sin O^J , 



(144) 



we have that this function will be proportional to fl CO n{(U) for u = U/N = (1/2 — m 2 /2). The coefficient of 
proportionality will give a vanishing contribution to s con f(u) in the thermodynamic limit. We note that, as in the 
canonical case, there is a continuous degeneracy on the direction of the spontaneous magnetization; therefore, we do 
not lose generality by choosing the spontaneous magnetization in the direction of the x axis. The integral in (I144|) 
can be computed using the Fourier representation of the 5 function. We therefore have 



cos 9i — Nm 



( — J J dqi J dq 2 J d6i exp 

dq 2 exp jiV [-iq>im + In J Q {{q\ + ql)^) } > 
where Jo(z) is the Bessel function of order 

p2TT 

Jo(z) = / d6 exp (iz cos 9) . 
Jo 



exp 



K]2 



E 



sin ( 



2 poo poo 

dqi 

— OO J — OO 



(145) 
(146) 

(147) 



To solve the integral in Q145|) with the saddle point method, we have to consider qi and q 2 as complex variables. Using 
that the derivative of Jo is —Ji, the opposite of the Bessel function of order 1, the saddle point has to satisfy the 
following equations 



im - j-^l + <&) k ) 

1 / / 2 , 2 \ i \ 
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= 
= 0. 



Jo (<ii + <£)'> 

The solution of these equations is q 2 = and q\ = —ij, where 7 is the solution of the equation 

— r4 = fn . 



(148) 



(149) 



Here, we have used the properties Jo(iz) = Io(z) and J\(iz) = il\(z). Denoting by Bi nv the inverse function of I1/I0, 
we can write in the thermodynamic limit 



Sconf I 7: - \rn 2 ) = lim \- lnfi m (m) = -mB inv (m) + In I (B inv (m)) 
A I / JV^+oo 1\ 



(150) 



40 



We can now substitute (fT50T> in (fT43|) , using that u = 2(e — 1/2 + to 2 /2), and performing equivalcntly a maximization 
over m instead of that over u, to obtain 



s(e) 



- + - ln(27r) + - In 2 
2 2 1 ' 2 



sup 

m>mn 



1 ^ ( 1 ^ 1 2 
— me — m 

2 I 2 2 



mB inv (m) + In I (B inv (m)) 



(151) 



with m,Q 
equation 



sup[0, 1 — 2e]. The maximization problem over m is solved graphically, looking for the solutions of the 



B inv (m) = . 



(152) 



2e - 1 + m 2 

The graphical solution m = m(e) is shown in Fig. [T51 and it gives the following results. For < e < 3/4, the 
magnetization m(e) monotonically decreases from 1 to 0, while for e > 3/4 the solution is always m = 0. At e = 3/4, 
there is a second order phase transition, a first signature that the two ensembles give equivalent predictions. 




b) 







e = 0.6 







m 
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FIG. 18: Graphical solution of Eq. (|152[) for three values of the energy s, one above and two below the critical value e c = 3/4. 
In both plots the dotted curve is the function Bi nv (m), while the solid curve is the function of the first term in Eq. (|152p . This 
last function diverges for m 2 = 1 — 2e, which is between and 1 for < e < 1/2. The corresponding asymptote is the vertical 
dotted line in the left panel, a) e = 0.47: the relevant solution is the one with m > 0; b) e = 0.6 and e — 0.9: in the first case 
the relevant solution is the one with m > 0, while in the second case the only solution is m = 0. 



Indeed, it's easy to prove that s(e) is concave. First, we easily derive from ()151[) that the inverse temperature /3(e) 



is 



1 



(153) 



F m 2 (s) 

If /3(e) has a negative derivative, then s(e) is concave. The negativity of this derivative is trivial for e > 3/4, when 
m(e) = 0. For e < 3/4 we can proceed as follows. We note that, using the last equation, we can write (|152|) also as 



m 



Ji(m/3(e)) 



(154) 



7 (m/3(e)) 

We have just proved graphically that (|152p . and thus (|154[) . have a unique solution for each e, that decreases if e 
increases. Studying (|134|) in the canonical ensemble, we have found that this unique solution has the property that 
m decreases when /3 decreases. Therefore, in the present case an increase in e results in a decrease in (3(e), also when 
m(e) > 0. This proves the concavity of the entropy s(e) and therefore ensemble equivalence. The concavity property 
of the microcanonical entropy (|151j) ensures that it could also be computed by the Legendre-Fenchel transform of <f>, 
the rescalcd free energy computed in the canonical ensemble and reported in Eq. (|133| . 

We conclude this section devoted to the explicit calculation of the canonical and microcanonical solution of the HMF 
model by showing the plots of the relevant thermodynamic variables. In Fig. [19] we show the full dependence of the 
entropy on the energy (see Eq. (|15ip ) and the rescaled free energy versus the inverse temperature (see Eq. (|133[) h In 
Fig- HOI w e plot the caloric curve (see Eq. (|153p ) and the dependence of the order parameter on the energy, obtainable 
from Eq. 1(152)) . 
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FIG. 20: Inverse temperature versus energy (a) and magnetization versus energy (b) for the HMF model (|126[1 with J = 1. 
The dotted lines are traced at the phase transition point. 



4- Min-max procedure 

We have already emphasized that the microcanonical partition function is much more difficult to obtain than the 
canonical one. Although we have been able to obtain it by a direct counting for the BEG model, this procedure is 
applicable only for discrete variables. In this subsection we will show how the microcanonical entropy can be obtained 
when the canonical free energy has been derived through an optimization procedure of the type shown in (|133j) . At 
the same time, this discussion will allow us to understand how ensemble inequivalence can arise. However, it should 
be remarked that the deriva tion presented here is not rigorous and relies on strong assumptions. We will follow the 
argument presented in Ref. |225l |. and later generalized in Ref. [72^ . although we will give here a somewhat different 
proof of the result. 

It is crucial to assume that the canonical partition function, similarly to what has been done in Eq. (|132[) for the 
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HMF model, is given by the following integral 



+00 



dx exp 



(155) 



with <f)(P, x) a difFerentiablc function of (3 for (/3 > 0) and a dummy variable x. This is a crucial assumption, which is 
at the core of the applicability of the method. We do not explicitly give any clue to what is the nature and physical 
meaning of the variable x, but of course we have in mind the integration variable which appears in the Hubbard- 
Stratonovich transformation. Moreover, it should be observed that the variable x has nothing to do with the order 
parameter m. Even the variation range of these two variables is different: in principle the range of x is the full real 
axis and the integral in Eq. (|155p is required to converge (for example, in the Hubbard-Stratonovich trasformation 
there is a dependence of the integrand of the kind exp(— x 2 )). 
In the thermodynamic limit, we therefore have 



cj>(P) =inf</»(/3,x) 



(156) 



The canonical entropy is defined by the following variational principle, 



(157) 



where Is is defined in formula (jTTSJ) . This is nothing but the Legendre-Fenchel transform of 4>{p). We observe that 
this expression is valid also in the case in which e is such that there is no /3 value for which the derivative of </>(/?) is 
equal to e. In our case, we can insert Eq. (|156[) to obtain 



inf 

/3>0 



sup 



pe-<f>iP,x) 



(158) 



One should remark that this relation is valid only if the assumption made for the partition function in Eq. (|155|) is 
fulfilled. In particular the x-dependence of cf> cannot be neglected in order for the integral in Eq. (| 1 55|) to converge. 
The relation between s(e), <f>(P) and s ca „(e) is summarized in Fig. |2~T1 



LFT 



LFT 



s(e) 



FIG. 21: Relation between microcanonical entropy s(e), rescaled free energy <j)(P) an d canonical entropy s can (e). LFT indicates 
the Legendre-Fenchel transform. 



When ensembles are equivalent s can (e) = s(e). We will show here that a necessary and sufficient condition for the 
ensembles to be inequivalent is that s(e) is strictly smaller than s CQn (e). This is for instance what happens for the 
BEG model, for which s can (e) is the concave envelope of s(e), as shown in Fig. [T3l 

We will prove here that, once the function <j)(P,x) is known, one can introduce its Legendre-Fenchel transform 



s(e,x)=inf j3e — <j>(fi, x) 



and obtain microcanonical entropy by the following formula 



s(e) = sup[s(e, x)] = sup 



inf 

/3>0 



(159) 



(160) 



The only difference in expressions (| 158[) and (|160p is just the order in which the minimum with respect to /3 and 
the maximum with respect to x are taken. Although this might seem a detail, it determines a different result (see 
Appendix A, where this is proven in full generality). It turns out that 



s(e) < s can (e) 



(161) 



43 



Let us remark that the entropy defined in formula (| 1 5Q[) docs not coincide with s(e, m). In particular, one can easily 
verify that s(e,x) is always concave in e for all x values, since it is obtained from a Legendre-Fenchel transform. On 
the contrary, s(e, m) can be non concave in e. 

Let us briefly sketch the proof of formula (|160[) . which is based on the analysis of microcanonical partition function. 
Using the Laplace representation of the Dirac delta function, this latter can be expressed as 

Q(E,N) = Yl SiE-HdSt})) (162) 
{Si,...,S N } 

= T~ dA E CX P[ A ^ - H ({Si})] (163) 

0-\-ioo 

dXZ(X,N)cxp(XE) (164) 

0— too 



1 

2nd 



where (3 = Re(A) > is the inverse temperature. We use A, instead of (3, as an integration variable, because we are 
considering the analytical continuation of Z(X,N) to the complex plane. The last integral cannot be solved, in the 
thermodynamic limit, using the saddle point method because, after expressing Z(J3, N) ~ exp[— N(3f(P)], the function 
(3f(/3) is not in general diffcrcntiablc for all (3. In spite of this, we can heuristically argue that the integral will be 
dominated by the value of the integrand at a real value of A (A = (3 + iAj), otherwise we would obtain an oscillatory 
behavior of f2(£7, N), giving rise to negative values, which are impossible for the density of states. To have a proof of 
this we can proceed as follows. 

We are assuming to be in cases where Z(/3, N), for real /3, can be expressed as in (|155|) . with <p analytic. Then, this 
integral representation will be valid, in the complex A plane, for at least a strip that includes the real axis, let's say 
for | A/ 1 < A, with A > 0. 

We now divide the integral in (|164p in three intervals, defined by Aj < —5, —5<Xj<5 and A/ > 5, respectively, 
with < S < A. In Appendix B, we show that the contribution to the integral in A coming from values of Aj 
outside the strip, i.e. for values of Xj with |A/| > A, is exponentially small in N. Therefore the calculation of the 
microcanonical partition function reduces to performing the following integral 

il(E,N) = / dXe NXs Z(X,N) = / dx dA exp ( N[Xe - 4>(X,x)} ) , (165) 

2m Jp_ iS 2m J_ OQ J p_ iS \ J 

where < 5 < A, and where, in the second equality, we have used the fact that inside the strip |A/| < A we can 
represent Z{(3,N) as in (|155|) . Since <p is analytic in all the domain of integration, we can perform the integral in A 
in the large N limit using the saddle point method. It can be shown that for each value of x the real part of the 
argument of the exponential in Q165P is larger if computed on the real axis A/ = 0. Indeed, we have 

dx cxp{N[Xe-4>(X,x)}} = cxp(NXe)Z(X,N) (166) 

■oo 

= exp(iVAe) ]T exp{-/?#({&}) - iXiH ({Si})} (167) 

{Si,...,Sjv} 

= exp(Nf3e) ^ exp{-/3H({5i}) - iX r H({Si}) + iAjiVe} (168) 

{Si,...,S N } 

= e X p(Nps)(exp(iXi[NE-H({Si})])}Z(P,N) , (169) 

where in the last expression the average (•) is performed with Boltzmann weight exp[/3H({Si})]. Using now the 
definition Z(f3,N) = J dxexp[—N(j)(f3,x)}, one can rewrite Q(E,N) as 

Q(E,N) = / dx d\exp(-N(j>(/3,x)+N/3£ + ln(exp{iX I [Ne-H({Si})]))) . (170) 

2m J_ OQ J p _ iS \ J 

The real part of the exponent in the integral is obtained by replacing ln((-)) with ln(|(-)|). The maximum of this 
logarithm is obtained for A/ = 0, since for all other nonzero values of A/, when performing the average in (•), one 
would sum unit vectors exp(iA/a) with different values of the phase a, depending on the specific configuration {Si}, 
obtaining a result which will have certainly a smaller modulus than summing all the vectors in phase with A/ = 0. 
Therefore, if there is a saddle point on the real axis, it will certainly give a larger real part for the argument of 
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the exponential than other saddle points eventually present outside the real axis. Moreover, we have just proven 
that the saddle is necessarily a maximum along the imaginary A direction. Therefore, from the general properties of 
holomorphic functions, it will be a minimum along the (3 = Re(A) axis. Once the integral over A is performed, the 
remaining integral over x can be computed using again the saddle point method, now for a real function, which gives 
a maximum over x. Combining all this, one gets the formula for the microcanonical entropy (|160[) . 



Another way of arguing 2251 ] is to remark that, since 0(A, x) is obtained by analytically continuing a real function, 
if saddle points are present in the complex A plane, they necessarily appear in complex conjugate pairs. This would 
induce oscillations in the values of £l(E, N) as a function of N, which would imply that Sl(E, N) could even take 
negative values. This is absurd and leads to exclude the presence of saddle points out of the real A axis. 

We have already remarked that ensemble inequivalence is a consequence of inequality (|16ip . which in turn derives 
from the different order of the minimum in (3 with respect to the maximum in x in expressions (| 1 58|) and (|160[) . Let us 
now study in more detail the extrema defined by the two different variational problems. The first order stationarity 
conditions are 

% - • <-> 
I - • ■ <-> 

which are of course the same for the two ensembles. However, the stability conditions deriving from the two problems 
are different. For what concerns the canonical entropy s can (e), we have the conditions 



d 2 4> d 2 4> 
d(3 2 dx 2 

while for the microcanonical entropy s(e) we have 

d 2 6 d 2 6 




d(3 2 dx 2 




(173) 
(174) 



< (175) 

< 0. (176) 



A necessary and sufficient condition to satisfy (JT73J), (fT74")l . (fT75|) and (fT76")) is that d 2 (p/d/3 2 < and d 2 (f)/dm 2 > 0. 
However, since the conditions are different in the two ensembles, one can find values of (3 and x that correspond to 
stable states in one ensemble but are unstable in the other. It should be noted that one could find more than one 
stationary point in a given ensemble, satisfying the corresponding stability condition. Obviously, in this case one has 
to choose the global extremum. If the global stable extrema are different in the two ensembles, then we have ensemble 
inequivalence. Tightly linked to stability is the sign of specific heat. Indeed, using the expression 

cv = -P gp2 > ( 17? ) 

where x{(3) is obtained by solving (|171[) and (|172[) . one can obtain an expression for the specific heat which is valid in 
both the ensembles 

d 2 4>d 2 4> ( d 2 4>\ 2 



dp 2 dx 2 \ dfidx j 

°v = -P 2 s~ — ■ ( 178 ) 

a 4> 

dx 2 

We see from (| 1 73|) and (|174[) that this expression is positive in the canonical ensemble. However, in the microcanonical 
ensemble the conditions (|175p and (|176p do not determine the sign of d 2 (f>/dx 2 , and thus the specific heat can have 
cither sign in the microcanonical ensemble. 
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Let us check that, using the method described in this subsection, we can rederive the microcanonical entropy of 
the HMF model, Eq. (T5T]) . For the HMF model, the function 4>((3,x) can be obtained from fTggP 

<M/3, x) = | - \ ln27r + l - hi/? + ^- - \nl (px) . (179) 
The stationarity conditions (|171[) and (|172|) read in this case 

S-*-"5^-°- <181) 

Inserting the second equation in the first, we find that = 2e — 1 + x . Substituting back in the second equation, 
and using the function B inv , the inverse function of previously defined, we have 

B in v(x) - 2£ _l + x 2 = . ( 182 ) 

which is identical to Eq. (|152[) . The computation of the second derivatives gives 

d 2 4> _ J_ _ d_ fh{Px) 
d/3 2 ~ ~W ~ X dx~ \I ((3x) 



2 - I -Pnd I < » (183) 



dx 2 dx \Iq((3x) 

These inequalities are both satisfied at the stationary points determined by (|180[> and (|18ip . Actually, Eq. (|183[) is 
identically satisfied, since the derivative of Ii/Iq is positive definite. This confirms ensemble equivalence for the HMF 
model. 

Finally, Eqs. (TT55)) and p^D|) give 

s(e) = s can (e) = \ + \ 1h(2tt) + \ In 2 + I In (e - X - + ±x< 

x 2 



ln ^o Tr—r—, , (185) 



2e-l + x 2 \2e-l + x 2 

with x satisfying Eq. (|182[) . We thus obtain an expression identical to (|15ip . taking into account Eq. I|152|) . 



E. The large deviation method and its applications 



The mathematical theory of large deviations is a field in its own, and obviously it is outside the scope of this 
review to give details of this theory and to treat at the level of mathematical rigor what will be presented about 
it. The methods based on large deviation theory and their applications to probl ems i n statisti cal mechanics have 
been popularized among theoretical physicists in several books and review papers fl32l . OH We would like to 

cite here a few interesting wo rks i n which physical systems with long-range interactions have been studied using this 
method. Michel and Robert |243| successfully used large deviations techniq ues to rigorou sly prove the app licability 
of statistical mechanics to two-dimensio nal fl uid mechanics, proposed earlier [2451 . 12751 . |276| |. Ellis et al. jl4l| pursued 
the approach of Robert and Sommcria [276] to solve two-dimensional geophysical systems. 

Many particle systems with long-range interactions often offer a relatively simple field of application of the theory 
of large deviations. This is very interesting, especially if one considers that the outcome of the calculation is the 
entropy function. 

The structure of this section is the following. In subsection IIVE II where we introduce the method, we will show 
how to obtain the entropy for a class of systems in which the Hamiltonian can be expressed in a way that is often 
realized in long-range systems. In subsection IIV E 2l we will show the simple application to the 3-states Potts model, 
where the result can be compared with the direct computation of the entropy. This simple discrete system already 
shows ensemble inequivalence. In subsection IIV E 3l we will treat again the HMF model to introduce the application 
of the method to systems with continuous variables. Interesting applications will be treated in subsections IIV E 4l and 
IIV E 51 both devoted to systems presenting ensemble inequivalence: a generalized HMF model and the so called (f> 4 
model. Finally, in subsection IIV E 6l we will present the equilibrium solution of the Colson-Bonifacio model of a linear 
Free Electron Laser. 



40 



1. The computation of the entropy for long-range systems 



Denoting again collectively with x = {{Pi},{qi}) the phase space variables of a Harriiltonian system, let us sup- 
pose that the energy per particle e(x) = H(x)/N can be expressed as a function of (few) global "mean-fields" 
Hi(x), . . . , fx n (x); i.e., we suppose that it is possible to write 

e(x)=e(fj, 1 (x),...,fji n (x)). (186) 

This is a situation often realized in long-range systems. Actually, one could require that formula (|186p be valid only 
asymptotically for N — > oo, while for large but finite N the right hand side could contain a remainder R(x) that 
can be neglected in the thermodynamic limit. However, in mean-field systems representation (|186[) is exact for all 
N . With the specification of the microscopic configuration x, we define what is generally indicated as a microstate of 
the system. On the contrary, specifying that the system is in a state in which the global variables have given values 
fii, . . . , we arc defining what is called a macrostate of the system. Once the macrostate is chosen, the microscopic 
configuration is not determined, since all x that satisfy fJ-k{x) = Hk for k = 1, . . . ,n belong to the same macrostate. 

The identification, in a concrete system, of the global variables is the first step in the application of the large 
deviation method. 

The second step will be the computation of the entropy of the different macrostates, i.e., the calculation of the 
function 

s(/ii, . . . ,/!„) = lim —]n I dx5(fj,i(x) - fix) .. .8(n„(x) - f^n) . (187) 

N^oo iv J 

Leaving aside for the moment the problem of computing s(fii, . . . ,/i„), that at first sight does not seem to be any 
simpler than computing the entropy function s(e), it is easy to see how this last function can be obtainable from 
s(fii, . . . , /i„). In fact, we have 



dx6[N(s(x)-s)} = I dx5[N(e(m(x),...,(j, n (x)) - e)} (188) 

dxdni . . .d/i„<5(^i(.x) - /ii) . ..S(fj, n (x) - /i„)<5 [N(e(ni, . . . ,/i„) - e)] (189) 
d^i . . . d^„ exp [Nsim , . . . ,//„)] 6 [JV(e(/xi, . ..,fj, n )- e)] . (190) 



We therefore have 



s(e) = sup s(m, . . . , fj, n ) . (191) 

[Mi,—,Mn|c(/ii,...,/i„)=e] 

The solution of this extremal problem constitutes the third and final step of the computation. We are left with the 
problem of computing s(/zi, . . . ,/i n ), i.e., with the actual implementation of the second step. Let us introduce the 
following canonical partition function 

Z(X u ...,X n ) = J dx eiq ? [-N(\ 1 fj, 1 {x) + --- + X n fj, n (x))} . (192) 

Few steps completely analogous to those relating 4>{0) = Pf(f3) to s(e) show that the free energy associated with 
Z(Xi, . . . , A„) is given by the (multi-dimensional) Legendre-Fenchel transform of s(pi, ■ • ■ , fJ-n)- 

(j>(\l,...,\n) = - lim — lnZ(Xi,...,X„) = inf [Xifii H h A„/i„ - s(/ii, . . . , fi n )] . (193) 

N~*00 1\ (Ltl,...,/i n 

We know that in general s(e) is not concave and it cannot be obtained by the Legendre-Fenchel transform of 4>(/3). We 
would expect the same difficulty in the inversion of ()193|) . However, if it happens that 4>(Xi, . . . , X n ) is differentiable 
for real A (see Appendix C), then we are guaranteed that s(/l*i, . . . , /x„) can be obtained by the inversion of Q193P and 
that it is therefore concave [l44j 

s(pi, . . . ,fi n ) = inf [Ai/iH h X„ji n - 0(Ai, . . . , A„)] . (194) 

Ai ,,\ n 

Obviously the practical usefulness of this method resides in the fact that generally, even in the presence of phase 
transitions, the function <f>(Xi, . . . , A„) happens to be differentiable. In Appendix C, we present a brief proof of the 
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differentiability of <fi(\i, ... ,X n ) and of the validity of (| 1 94[) for the cases in which the global variables Hk(x) are sums 
of one-particle functions. 

Finally, it is not difficult to sec how <fi(f3) is related to s(/ii, . . . , fx n ). Again, with steps analogous to those linking 
</>(/?) to s(e), we find that (j>(/3) is given by the following extremal problem 

<f>(0) = pf[fi) = inf [pe(fi u . . . , fx n ) - s(/i 1; . . . , fx n )} . (195) 

(Ul,...,/i„ 

The two variational problems (|191|) and (|195p express the microcanonical entropy and the canonical free energy as a 
function of s(fJ>i, . . . , fJ-n) and of the energy function e(pi, . . . , fi n ), and offer a tool to study ensemble equivalence in 
concrete systems. 

It is important to emphasize that the large deviation method allows to reduce the statistical mechanics study of a 
model to an optimization problem. The method of global variables reduces exactly (or sometimes approximately) the 
search of the equilibrium solution to a variational problem. This approach often drastically simplifies the derivation 
of the statistical mechanics properties and is not very well known among physicists. 

However, such a procedure does not apply to all long-range interacting systems. In particular, those for which 
statistical mechanics cannot be reduced (even approximately) to a mean-field variational problem are excluded. As 
shown, the method is strongly dependent on the possibility to introduce global or coarse-grained variables (examples 
are the averaged magnetization, the total kinetic energy, etc.) such that the Hamiltonian can be expressed as a 
function of these variables, as in Eq. (|186p . modulo a remaining term R(x) whose relative contribution vanishes in the 
thermodynamic limit. If also short range interactions are present the procedure will not be possible. One might still 
be able to express the Hamiltonian as a function of coarse-grained variables plus a rest, but the rest will not vanish 
in the thermodynamic limit. 

The global variables {/z^} could be given by fields. For instance, they could correspond to a local mass density 
in a gravitational system, or a coarse-grained vorticity density in 2D turbulence. In these cases they would be 
mathematic ally infinite dim ensional variables. However, with natural extensions, the steps that we have showed can 
be repeated flH [H3l l243j . 

Finally, we note that we could be interested in entropy functions depending not only on the energy density s, but 
also on other quantities, see e.g. Eq. (|9"5)) for the dependence on magnetization. In this case, also these additional 
quantities should be expressed as functions of the global variables Hi, . . . , fx n (e.g. m(x) = m(/ii, . . . , /in)), while the 
variational problem (|191[) will be constrained to fixed values of e(ni, . . . , = e and m(/ii, . . . , //„) = m. Examples 
of this sort will be discussed in the following. 



2. The three-states Potts model: an illustration of the method 

Here, we apply the large deviation method to the three-state Potts model with infinite range interactions [37| . 
This si mple example has been used as a toy model to illustrate peculiar thermodynamic properties of long-range 
systems jl88| . The diluted three-state Potts mod el wi th short-range interactions has also been studied in connection 
with "Small" systems thermodynamics by Gross [l68j . 

The Hamiltonian of the three-state Potts model is 

/ N 

H ^ = -2nH S ^ ■ (196) 

Each lattice site i is occupied by a spin variable Si, which assumes three different states a, b, or c. A pair of spins 
gives a ferromagnetic contribution — J ( J > 0) to the total energy if they are in the same state, and no contribution 
otherwise. It is important to stress that the energy sum is extended over all pairs the interaction is infinite 

range. 

Let us apply the method, following the three steps described in the following. The first step of the method consists 
in associating, to every microscopic configuration x, global (coarse-grained) variables, such that the Hamiltonian can 
be expressed as a function of them. For Hamiltonian Q196P the appropriate global variables are 

H = {n a , fi b ) = (n , rib) , (197) 

where (n a , rib) are the fractions of spins in the two different states a, b. As a function of the microscopic configuration, 
we have 



1 N 



N 
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with k — a, b. The Hamiltonian is expressed in terms of the global variables as 

e{n a ,n b ) = -^[n 2 a + n 2 + (l-n a -n b ) 2 ] , (199) 

which is exact for any N. 

The second step is the computation of s(/xi,/X2) According to what we have shown in the previous subsection, we 
have to compute first the partition function 

(N N \ 

-Kj2 S s„a-^Y, 6 s,-b) , (200) 
t=l i=l / 

where the integral over the configurations is a discrete sum in this case. The last expression is easily solved, to find 

Z(X ai X b ) = (e~ K +e- A " + 1) N . (201) 

We therefore obtain 

<KA a , A 6 ) = - In (e- A » + e~ x » + l) . (202) 

As this function is evidently analytic, we can compute the function s by a Legendre-Fenchel transform. The entropy 
function s(n ai n b ) will then be 

s{n a ,n b ) = inf [X a n a + X b n b + In (e~ A » + e~ x » + l)] , (203) 

which is easily solved to get 

s(n ai n b ) = —n a hxn a - n b \nn b - (1 - n a - n b ) In (1 - n a - n b ) . (204) 

We now proceed to the third and final step of the calculation of the entropy s(e). The variational problem Q19ip 
becomes 

s(e) = sup (—n a In n a -n b Inn* - (1 —n a —n b ) ln(l - n a — n b ) -— (n 2 . + n b + (1 - n a - n b ) 2 ) = e) . (205) 

As we have anticipated, this expression could be obtained by direct counting. The variational problem (|205p can be 
solved numerically. The microcanonical inverse temperatu re ca n then be derived and it is shown in Fig. [2U in the 
allowed energy range [— J/2,— J/6]. Ispolatov and Cohen [l88| ] have obtained the same result by determining the 
density of states. A negative specific heat region appears in the energy range [—0.215 J, — J/6]. 
Let us now consider the canonical ensemble. Applying Eq. (|195|) to Hamiltonian (|196[) . we have 

(f>((3) = inf (n a In n a + n b In n b + (1 - n a - n b ) ln(l - n a - n b ) - (n 2 a + nl + (1 - n a - n b ) 2 )) . (206) 

n a ,n b \ Z J 

To obtain the caloric curve, one has to compute e = d<f>/d/3. Figure [22] shows that at the canonical transition inverse 
temperature j3 ~ 2.75, corresponding to the energy s / J ~ —0.255, a first order phase transition appears, with an 
associated latent heat. The low energy "magnetized" phase becomes unstable, while the high energy "homogeneous" 
phase, which has the constant energy density, e/J = —1/6, is stabilized. In Fig. [22j the two dotted regions have 
the same area, respecting Maxwell's construction. At the inverse transition temperature, there is also a jump in the 
global variables (n a , n b , 1 — n a — n b ), which are the order parameters of the model. 

This extremely simple example shows already ensemble inequivalence. In the microcanonical ensemble, there is no 
phase transition and the specific heat becomes negative. On the other hand, in the canonical ensemble, there is a first 
order phase transition with a latent heat. The caloric curves do not coincide. We observe that in the energy range of 
ensemble inequivalence, microcanonical temperatures do not coincide with any canonical one. 



3. The HMF model: dealing with continuous variables 

In this subsection, we consider again the HMF model to show the implementation of the large deviation method 
to a system with continuous variables. Let us write again here Hamiltonian (|128p using the components of the 
magnetization, m x and m y , defined in Q127P explicitly evidenced: 



N 



2 2 
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FIG. 22: Caloric curve (inverse temperature vs. energy density) for the three-states infinite range Potts model. The canonical 
solution is represented by a solid line. The microcanonical solution coincides with the canonical one for e < e t and is instead 
indicated by the dash-dotted line for s < e < — J/6. The increasing part of the microcanonical dash-dotted line corresponds 
to a negative specific heat region. In the canonical ensemble, the model displays a first order phase transition at /3*. The 
two dotted regions bounded by the dashed line and by the microcanonical dash-dotted line have the same area (Maxwell's 
construction). 



By a direct inspection of Hamiltonian (|207[) . one can identify the global variables u = pf, m x and m y . Since 

v = X) i Pii the- average momentum, is a conserved quantity with respect to the dynamics defined by the Hamiltonian, 
it is convenient to included it among the global variables. Therefore, we will compute the entropy function s(e,v). 
The first step of the procedure therefore consists in the identification of the following global variables 



H = (u,v,m x ,m y ) . 



(208) 



We note that also in this case the expression of the Hamiltonian as a function of the three global variables u, m x and 



m y is exact for each TV, since we have 



e(u,m x ,m y ) = - (u + 1 



2 2\ 

m x - m y ) 



(209) 



without a reminder. Let us now go to the second step, i.e. the computation of s(u,v,m x ,m y ). We know, by now, 
that first we have to compute the partition function 



Z(X U , A„, X x , \) — 
which is solved to get 



. / \ / N N N N \ 

/ II dOidpi J exp I -X u y^Pj ~ A„ - cos ® l ~ z)2 sin ® l ) ' 

J \ i / V i=l i=l i=l i=l / 



Z (A u , X v , X x , Xy) 



i — l N 



(210) 



(211) 



where Iq is the modified Bcsscl function of order 0. We note that the existence of the integral in (|210[) requires that 
A„ > 0. Then, we have 



,Xy) = - ^hi7r+ ilnA u - ]nl ^ XI + A;)) 



4A M 2 

The analyticity of this function allows us to write that 



(212) 



s(u, v 1 m x , rriy) = inf 

lAu ,X X , A,, 



A 



1 



X u u + X v v + X x m x + Xyiriy + -y- + - Imr 

4A 7/ Z 



-ill /A2+A2 



(213) 
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This variational problem can be solved explicitly, and this can be done for the "kinetic" subspace (A u , X v ) separately 
from the "configurational" one (A ;c ,Ay), giving 

s(u,v,m x ,m y ) = s kin (u,v) + s 

conf \JW*x 1 

m y ) . (214) 
We note in addition that s con f(m x ,m y ) does not depend on m x and m y separately, but on the modulus to = 



m x + m y . This was expected, since, as already emphasized, we have degeneracy with respect to the direction of 

the spontaneous magnetization. Using the function Bi nv previously introduced, i.e., the inverse function of Ix/Io, we 
obtain 



Skin(u,v) = i + - hi7r + i ln2(u - v 2 ) 
Sconf(m) = -mB inv (m) + ]nI (B inv (m)) 

Let us remark that Cauchy-Schwarz inequality implies that u > v 2 . 
The third step of the procedure gives the entropy function 



s(e,v) 



sup 

u.m 



s(u, v, to) 



u 1 
2 + 2 



= £ 



sup 

u.m 



(u, v) + s con f(m) 



1 m z 



where in the last equality m satisfies the equation 

m 



2 2 2 

+ ^ ln(47r) + ^ In ( e - ^ + ^m 2 - ^v 2 ) - mB v 



B^v (m) = . 



(215) 
(216) 



(217) 

(218) 
(219) 

(220) 



This function depends on energy and on momentum. Maximizing with respect to v, we obtain s(e). It is easy to find 
that this maximum is obtained when v = for each e, and that the entropy s(e) is given by 



«(e) = - + -ln(47r) 



1 



In e - 



2e - 1 + m 2 



In Jo 



1 1 

2 + 2 ? 



2e-l + m? 



(221) 



with m satisfying Eq. 



taken at v = 0. We have therefore recovered the previous expressions, i.e., Eq. (|151[) with 



Eq. |T52|) or Eq. Ipg]) with Eq. (p2]l . 

For completeness, we also derive the rescaled free energy <j)([3) = /3/(/3). Applying Eq. ()195|) . taking into account 
(f209|) . wc get 



4>{f3) = pf{0) = inf 

u,v,r> 

This variational problem can be easily solved to get (|133p 



. u 1 In 

B[ — m 2 

1 2 2 2 



s(u, w, to) 



(222) 



^. A generalized HMF model: ergodicity breaking 
A generalized HMF model has been recently introduced with Hamiltonian [62|, [68 



where 



% = E| + t/, (223) 



-^-to 2 - K^- ) 



with ?7i = yTO^ + m^, being m x ,m y defined in formula (|127p . Models of this kind have been used to describe the 
physics of nematic liquid crystals |22l| 



■51 



a. Statistical mechanics Let us now study the statistical mechanics of model f|223[) with J, K > using large 
deviation techniques, in a similar way as it was done for the HMF model in subsection IIV E 31 Like the HMF model 
this model has four global variables 

ji = (m x ,m y ,u,v) (225) 

and its microcanonical entropy is given by the same formula (|2 1 T[) as for the HMF model. However, the consistency 
equation (|220p is replaced by the following 

Jm + Km 3 

BM = 2E + Jm* + Km*l2-tf ' (226) 

One solves analytically this relation for second order phase transition and numerically for first order phase transition. 
Taking then into account the standard relation between inverse temperature and energy, one gets the phase diagram 
and the caloric curves discussed below. Similarly to the HMF model, one can also get the rescaled canonical free 
energy 



<j){/3) = inf 



^— + -tL- ln(27r/ [/3( Jm + Km 3 )}) 



(227) 



In Fig. [23] we report the phase diagram of model (|223[) in the parameter space (T/J,K/J) in both the canonical 
and the microcanonical ensemble. As expected (42l |. the two phase diagrams differ in the region where the canonical 
transition line is first order. The second order phase transition lines at T/J = 1/2 coincide up to the canonical 
tricritical point, located at K/J = 1/2, but then the predictions of the two ensembles differ: while the canonical 
ensemble gives a first order phase transition, in the microcanonical ensemble the critical line at T/J = 1/2 persists up 
to the microcanonical tricritical point at K/J — 5/2. Even more striking is the difference between the two ensembles 
for K/J > 5/2. Here, two different temperatures coexist at the transition energy in the microcanonical ensemble, 
showing what we have called a temperature jump. This corresponds to a microcanonical first order phase transition, 
whose "phase coexistence" region is shown by the shaded area in Fig. [23] As for the order parameter m, the most 
impressive difference between the two ensembles is in the intermediate region between the canonical first order phase 
transition line and the upper microcanonical first order line, where the canonical ensemble predicts a non vanishing 
order parameter m can =/= 0, while the microcanonical ensemble gives m m i C = 0. 
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m ca „ = 




(Canonical tricritical point 
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FIG. 23: Phase diagram of model (I223|l . The canonical second order transition line (solid line at T/J = 1/2) becomes first 
order (dotted line, determined numerically) at the canonical tricritical point. The microcanonical second order transition 
line coincides with the canonical one below K/J = 1/2 but extends further right to the microcanonical tricritical point at 
K/J = 5/2. At this latter point, the transition line bifurcates in two first order microcanonical lines, corresponding to a 
temperature jump. The behavior of the order parameter in the two ensembles is also shown in the figure, to highlight the 
striking difference in the predictions of the two ensembles. 

Figure [24] displays temperature vs. energy, so-called caloric curve, in the region where both the canonical and 
the microcanonical ensemble predict a first order phase transition. However, the microcanonical ensemble gives two 
temperatures at the transition energy. Both a region of negative specific heat and a temperature jump are present 
in the microcanonical ensemble. The specific heat is always positive in the canonical ensemble and latent heat is 
released. In a microcanonical simulation performed with N = 100 particles (shown by the points in Fig. I24p . the 
temperature jump is smoothed by the finite size effects and negative specific heat is well reproduced. The agreement 
with microcanonical thermodynamics predictions is very good. 
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FIG. 24: Caloric curve T(e) at K/J = 10. The microcanonical ensemble (solid line) predicts a region of negative specific heat, 
where temperature T decreases as the energy is increased. Moreover, a temperature jump is present at the transition energy. In 
the canonical ensemble, we have a first order phase transition (dashed line). The points are the result of a molecular dynamics 
simulation performed by solving numerically the equations of motion given by Hamiltonian (|223p with N = 100. 

We have shown how a simple generalization of the HMF model can give rise to all the features of ensemble 
inequivalence displayed by the BEG model, i.e. negative specific heat and temperature jumps in the microcanonical 
ensemble. A novel characteristic of this model with respect to the BEG model is the fact that the variables take 
continuous values. Moreover, the model possess a true Hamiltonian dynamics which allows us to confirm the theoretical 
predictions obtained in the thermodynamic limit also for finite N and to study non equilibrium features. 

b. Parameter space convexity We here discuss a concrete example where we can show that the space of ther- 
modynamic parameters is not convex, as already discussed in Sec. Ill CI Historically, this phenomenon has been first 
observed for a spin chain wi th a symmetric coupling [55l . Ifjfjl . l87j , and for an Ising model with both nearest neighbour 
and mean-field interactions |248j |. see Sec. IVI ATI However, it has been later realized that this occurs more generally, 
and even in the simple model studied here (|223p . We will consider the case J = — 1 and K > for which the m 2 
term in the Hamiltonian (|223|) is antiferromagnetic. We will study the structure of the set of accessible states in the 
space of thermodynamic parameters in the microcanonical ensemble. 

Intuitively, we expect that, for large values of K, the system is ferromagnetic while, for small values of K, the 
antiferromagnetic coupling will dominate and makes the system paramagnetic. As wc shall demonstrate below, there 
exists a range of values of the parameter K for which the model exhibits a first order microcanonical phase transitions 
between a paramagnetic phase at high energies and a ferromagnetic phase at low energies. In both phases there 
are regions in the (e, K) plane in which the accessible magnetization interval exhibits a gap, resulting in breaking of 
crgodicity. 

The specific kinetic energy u/2 = e — W(m) is by definition a non negative quantity, which implies that 

£ > W{m) = m 2 /2 - Km 4 /4,. (228) 

We will show that as a result of this condition not all the values of the magnetization m are attainable in a certain 
region in the (e, K) plane; a disconnected magnetization domain is indeed a typical case. As explained below, this 
situation is the one of interest. Let us characterize the accessible domains in the (e, K) plane more precisely by 
analyzing the different values of K (all this will become clear after looking at Fig. |2"5|) . 

For K < 1, the local maximum of the potential energy W is not located inside the magnetization interval [0, 1] 
(see Fig. 125b ) . The potential being a strictly increasing function of the magnetization, the maximum is reached at 
the extremum m = 1. The complete interval [0, 1] is thus accessible, provided the energy e is larger than W(l): the 
corresponding domain is in Rl defined and illustrated in Figs. 1231 and |2"6"1 The horizontally shaded region is forbidden, 
since the energy is lower than the minimum of the potential energy VF(0) = 0. Finally, the intermediate region 
< e < W(l) defines region R2: it is important to emphasize that only the interval [0, to_ (e, K)}, where m± (e, K) = 
[(1 ± VI — 4:eK)/ 'K] 1 ' 2 , is accessible. Larger magnetization values correspond to a potential energy W(m) larger than 
the energy density e, which is impossible. Figure [2"S"h also displays the value m_ corresponding to an energy in the 
intermediate region R2. 

For K > 1, the specific potential energy W has a maximum W max = l/4i^ which is reached at < TO max = 1/VK < 
1. Figures [25b and l25h . where the potential energy per particle defined in Eq (|224|) is plotted vs magnetization m, 
display such cases. For an energy e larger than the critical value W max , condition (|228p is satisfied for any value 
of the magnetization m. The complete interval [0, 1] is thus accessible for the magnetization m. This region is Rl 
represented in Fig. [26] 
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FIG. 25: Specific potential energy W vs. magnetization m for three different cases: K < 1 (panel a), 1 < K < 2 (panel b) and 
2 < K (panel c). The location of the maximal magnetization m ma x and the corresponding potential energy VKmax are shown 
(see text). In panels (a) and (c), two examples of the location of the critical magnetization m±(e,K) is indicated for energy 
density values e in the intermediate regions. 
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FIG. 26: The (e, K) plane is divided in several regions. The solid curve corresponds to K = l/(4e), the oblique dashed line to 
K = 2 — 4e, while the dotted one to K = 1. The vertically shaded, quadrilled, and horizontally shaded regions are forbidden. 
The accessible magnetization interval in each of the four regions is indicated (see text for details). 

Let us now consider the cases for which e < W max . As discussed above, the minimum W m i n of the potential energy 
is also important to distinguish between the different regions. For 1 < K < 2 (see Fig. 125b). the minimum of W(m) 
corresponds to the non-magnetic phase m = where W^O) = 0. The quadrilled region shown in Fig. 1261 which 
corresponds to negative energy values, is thus not accessible. On the contrary, positive energy values are possible 
and correspond to very interesting cases, since only sub-intervals of the complete magnetization interval [0, 1] are 
accessible. There are however two different cases 

• for < W(l) < e < W maK , the domain of possible magnetizations is [0,m_ (s,K)] U [m + (e 7 K) , 1]. The above 
conditions are satisfied in R3 of Fig. [26] 

• for < e < W(l), only the interval [0, m_ (e, K)] satisfies condition (|228|) . This takes place within R2 of Fig. EH 

For the domain 2 < K, the minimum of the potential energy is attained at the extremum, m = 1, implying 
e > W(l) = 1/2 — K/A. The vertically shaded region is thus forbidden. In the accessible region, two cases can be 
identified 

• for W-^l) < e < 0, only the interval [m+ (e, K) , 1] satisfies condition ()228[) . It is important to note that 
TO+ (e, K) < 1 provided e > 1/2 — K/A. These cases correspond to region R4- 

• for < e < Wmax, the two intervals [0,m_ (e, K )] and [m+ (e,K) , 1] satisfy condition (]228[) . corresponding to 
R3 of Fig. M 

In summary, the complete magnetization interval [0, 1] is accessible only in the region Rl. In R2, only [0,to_] is 
accessible, while only [m + , 1] is accessible in R4- Finally, we note that the phase space of the system is not connected 
in the region R3. Indeed, the magnetization cannot vary continuously from the first interval [0, m_] to the second one 
[?77 + , 1], although both are accessible. These restrictions yield the accessible magnetization domain shown in Fig. |2"T1 
The fact that for a given energy the space of the thermodynamic parameter m is disconnected implies crgodicity 
breaking for the Hamiltonian dynamics. It is important to emphasize that the discussion above is independent of the 
number of particles and ergodicity is expected to be broken even for a finite N . 

c. Phase diagram in the microcanonical ensemble We have thus found that in certain regions in the (e, K) plane, 
the magnetization cannot assume any value in the interval [0, 1]. For a given energy there exists a gap in this interval 
to which no microscopic configuration can be associated. Let us now study the phase diagram in the microcanonical 
ensemble, i.e. in the parameter space (e,K). 

First, by comparing the low and high energy regimes, it is possible to show that a phase transition is present 
between the two regimes. In the domain R4 of Fig. [23 for very low energy e (close to the limiting value 1/2 — K/A), 
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FIG. 27: Accessible region in the (m, e) plane for K — 3. For energies in a certain range, a gap in the accessible magnetization 
values is present and defined by the two boundaries m±(e,K). 

the accessible range for m is a small interval located close to m = 1 (sec Fig. 125b ) . The maximum in m of the 
entropy s(e, m) corresponds therefore to a magnetized state located very close to to = 1 (see the top-left inset in 
Fig. I28[) . On the contrary, in the very large energy domain (e 3> m ~ 1), the variations of the entropy with respect 
to m are dominated by the variations of the configurational entropy s con f, since the kinetic entropy 



is roughly a constant s ~ (lne)/2 when m is of order one. As expected, the configurational entropy is a decreasing 
function of the magnetization: the number of microstates corresponding to a paramagnetic macrostate being much 
larger than the same number for a ferromagnetic state. The configurational entropy has therefore a single maximum 
located at m* = 0. A phase transition takes place between the paramagnetic state at large energy and a magnetic 
state at small energy. Moreover, as the paramagnetic state is possible only for positive energies e (see Figs. |2"5)) , the 
transition line is located in the domain e > 0. In this region, the quantity &f n s{e, 0) = —(1 + l/(2er)) is negative, 
which ensures that, for any value of e and K, the paramagnetic state m* — is a local entropy maximum. The 
latter argument allows us to exclude a second order phase transition at a positive critical energy, since the second 
derivative 9 T 2 n s(e, 0) would have to vanish, which is impossible. The above argument leads to the conclusion that the 
phase transition must be first order. 

Let us now focus on the behavior of the entropy in the region R3, where the accessible range for m is the union of 
two disconnected intervals [0, m_] U [m + , 1]. As discussed above, the entropy s(e, m) has a local maximum in the first 
interval [0,m_] located at m* = and associated with the entropy sj„ ax = s(e, 0) = log(e)/2. In the second interval 
[m + , 1], a maximum is also present with s^ ax = s(e,m*) where m* > m + > 0. As si iax (e) diverges to — oo when e 
tends to 0, a magnetized state is expected on the line e = 0, as long as s^ ax remains finite. Since K = 2 is the only 
value for which s^ ax (0, K) diverges, the first order transition line originates at the point -8(0, 2) in Fig. [351 Although 
it is possible to study analytically the asymptotic behavior of the transition line near this point, we can rather easily 
compute numerically the location of the first order transition line, represented by the dash-dotted line in Fig. 1281 

Figure |2"51 also shows the metastability line (the dotted line starting at point .4(1/4,1)), for the magnetized state 
to* 7^ 0. To the right of this metastability line, there is no metastable state (local entropy maximum for any m > 0, see 
bottom-right inset in Fig. I28[) while a metastable state (local maximum) exists at some non vanishing magnetization 
on the other side (see top-right inset in Fig. [28)) . Finally, the vertical dotted line e = corresponds to the metastability 
line of the paramagnetic state m* = 0. 

One of the key issues we would like to address is the possible links between the breakdown of connectivity, and 
thus ergodicity breaking, on the one hand, and the phase transition, on the other hand. Obvious general properties 
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FIG. 28: Phase diagram of the mean field model (|223[) with J = — 1. The dash-dotted curve corresponds to the first order 
phase transition line, issued from the point B(0,2). As in Fig. 1261 the solid curve indicates the right border of the region R3, 
where the space of thermodynamic parameters is disconnected. The dashed line corresponds to the K = 2 — 4e. The dotted 
line issued from the point ^4(1/4, 1) represents the metastability line for the magnetized state, while the e = vertical dotted 
line is also the metastability line for the paramagnetic state. The four insets represent the entropy s versus the magnetization 
m for the four energies: e =-0.05, 0.005, 0.1 and 0.35, when K = 3. 



do exist: a region of parameters where the space is disconnected corresponds to a region where metastable states do 
exist. Let us justify this statement. At the boundary of any connected domain, when the order parameter is close to 
its boundary value m&, there is a single accessible state. In a model with continuous variables, like the one we discuss 
here, this leads to a divergence of the entropy. In this case the singularity of the entropy is proportional to ln(m — rrib) 
(see for instance equation (|229[) V For a model with discrete variables, like an Ising model, the entropy would no more 
reach — oo as m tends to mt,, but would rather take a finite value. However, the singularity would still exist and would 
then be proportional to (m — mj,) ln(m — m^). In both cases, of discrete and continuous variables, at the boundary of 
any connected domain, the derivative of the entropy as a function of the order parameter tends generally to ±oo. As 
a consequence, entropy extrema cannot be located at the boundary. Thus, a local entropy maximum (metastable or 
stable) does exist in a region of parameters where the space is disconnected. 

Hence, there is an entropy maximum (cither local or global) in any connected domain of the space. For instance, 
considering the present model, in Fig. [551 the area R3 is included in the area where metastable states exist (bounded 
by the two dotted lines and the dashed line). In such areas where metastable states exist, one generically expects first 
order phase transitions. Thus the breakdown of phase space connectivity is generically associated to first order phase 
transition, as exemplified by the present study. However, this is not necessary, one may observe metastable states 
without first order phase transitions, or first order phase transitions without connectivity breaking. 

A very interesting question is related to the critical points A and B shown in Fig. As observed in the phase 
diagram, the end point for the line of first order phase transition (point B) corresponds also to a point where the 
boundary of the region where the space is disconnected is not smooth. Similarly, the end point for the line of 
appearance of metastable states (point A) is also a singular point for the boundary of the area where the space is 
disconnected. It is thus possible to propose the conjecture that such a relation is generic, and that it should be 
observed in other systems where both first order phase transitions and phase space ergodicity breaking do occur. 

d. Equilibrium dynamics The feature of disconnected accessible magnetization intervals, which is typical of sys- 
tems with long-range interactions, has profound implications on the dynamics. In particular, starting from an initial 
condition which lies within one of these intervals, local dynamics is unable to move the system to a different accessible 
interval. A macroscopic change would be necessary to carry the system from one interval to the other. Thus the 
ergodicity is br oken in the microcanonical dynamics even at finite N . 

In Ref. [248l |. this point has been demonstrated using the microcanonical Monte-Carlo dynamics suggested by 
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Creutz [HH, see Sec. I VI ATI Here, we use the Hamiltonian dynamics given by the equation of motions 
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We display in Fig. [29] the evolution of the magnetization for two cases, since we have shown above that the gap opens 
up when e decreases. The first case corresponds to the domain Rl, in which the accessible magnetization domain is the 
foil interval [0, 1]. Fig. 129a presents the time evolution of m. The magnetization switches between the paramagnetic 
metastable state m* = and the ferromagnetic stable one m* > 0. This is possible because the number of particles 
is small (N = 20) and, as a consequence, the entropy barrier (see the inset) can be overcome. Considering a system 
with a small number of particles allows to observe flips between local maxima, while such flips would be less frequent 
for larger N values. 
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FIG. 29: Time evolution of the magnetization m (the entropy of the corresponding cases is plotted as an inset). Panel a) 
corresponds to the case e = 0.1 and K = 8, while panel b) to e = 0.0177 and K = 3. In panel b), two different initial conditions 
are plotted simultaneously: the solid line corresponds to m(t = 0) =0.1 while the dashed line to m(t = 0) = 0.98. The dashed 
(resp. dotted) line in panel b) corresponds to the line m = m+ ~ 0.794 (resp. m = m_ ~ 0.192). 



In the other case, we consider a stable m* = state which is disconnected from the metastable one. This makes 
the system unable to switch from one state to the other. Note that this feature is characteristic of the microcanonical 
dynamics, since an algorithm reproducing canonical dynamics would allow the crossing of the forbidden region (by 
moving to higher energy states, which is impossible in the microcanonical ensemble). The result of two different 
numerical simulations is reported in Fig. 129b . One is initialized with a magnetization within [0, m_], while the other 
corresponds to an initial magnetization close to ?n(0) = 1 (i.e. within [m+, 1]). One clearly sees that the dynamics 
is blocked in one of the two possible regions, and not a single jump is visible over a long time span. This is a clear 
evidence of ergodicity breaking. 

5. The mean-field (f> 4 spin model: negative susceptibility 

We here show how to solve the so called mean-field 4 spin model using large deviations. The Hamiltonian of the 
model is the following 
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It is a system of unit mass particles moving on a line. These particles are subjected to a local double-well potential, 
and they interact with each other through a mean-field (infinite range) interaction given by the all-to-all coupling in 
the double sum. 

This model was introduced by Desai and Zwanzig jf 33| . More recently, the canonical solution was obtained [l26l ]. 
showing that the system exhibits a second-order ferromagnetic phase transition at a critical temperature T c ~ 0.264, 
corresponding to a critical en erg y e c = T c /2 ~ 0.132. Later, the entropy of the c/> 4 model in presence of an extra 
magnetic field was computed [781 ] . Finally, a calculation of the entropy, as a function of energy and magnetization 
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has been performed in Ref. [79|, Il70l . Il7l| . We note that in this model the modulus of this quantity is not bounded 
inside the interval [—1,1]. 

We will perform the three steps of the large deviation method, in order to compute the microcanonical entropy as 
a function of the energy e and of the magnetization to; i.e., we will compute the function s(s,m) defined in Eq. (195p . 
More details can be found in Ref. (79j . 

The first step consists in the identification of the global variables in terms of which we can express the energy e. In 
addition to magnetization other global variables are u, twice the average kinetic energy, and 
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related to the local potential. We easily see that 
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The second step begins with the computation of the partition function 
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Also in this case the calculation splits into the one of the kinetic part and of the potential part, and we have 
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where the two terms on the right hand side define Z U (X U ) and Z z m (X z , X m ), respectively. The existence of the integral 
requires A u > and X z > 0. From the previous expression, we find 
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Both these functions are analytic, and therefore we can write 

s(u, z, m) = s kin (u) + s conf (z, to) . 



where 



Skin(u) = inf [X u u - <j) u (X u )] =\ + \ ln(27r) + - lnu 
if(z,m) = ^inf [X z z + X m m - cf> Ztm (X z ,X m )] . 
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The third step of the procedure gives the entropy function 



s(e,m) = sup s kin (u) + s conf (z,m) - + z - — = e 

u.z ^ 4 

that can be rewritten as 

s(e, to) = i + i ln(47r) + sup i In - z + ^-^j + s conf (z, to) . 
This variational problem can be solved numerically and the result is plot in Fig. [30] for different values of e. 
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FIG. 30: (a) Entropy s as a function of magnetization m for different values of the energy e. (b) 3D view of s(e,m). The black 
dots show the location of the equilibrium values m* in the microcanonical ensemble in which the magnetization constraint is 
released, and therefore the equilibrium at a given e is realized for the m* value maximizing s(e,m). 



The purpose of this subsection, besides the possibility to offer a further implementation of the large deviation 
method, is to show an example of ensemble inequivalence which is not related to the presence of a negative specific 
heat in the microcanonical ensemble, but rather to the negativity of another quantity that in the canonical ensemble 
is positive definite, namely the magnetic susceptibility at constant temperature (see Sec II V C~3|) . Although striking, 
this feature appears for other long-range systems whose entropy depends on two macroscopic varia bles. For instance, 
it has been recently discussed in connection with models of two-dimensional geophysical flows [308| , for which entropy 
depends on total circulation and energy. 

The inequivalence has to be considered with respect to the canonical ensemble, whose partition function is 
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From this we derive the rescaled free energy 



lim — In Z(f3, h) = inf [fie — [3hm — s(e, m)l 
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It is straightforward to apply to this case expression (|195p . obtaining the free energy in the framework of the large 
deviation method. In the concrete case of the <f> 4 model wc have 



/3f(/3,h) = inf \(3e(u, z, to) — f3hm — s(u, z, m)] 



(248) 



Our presentation in subsection IIV C 31 already contains all we need to study magnetic susceptibility. We repeat here 
the basic formula in Eq. (|125p and apply it to the <fi 4 model. 
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which is valid for both the canonical and the microcanonical ensemble. It turns out that s ££ is always negative, and 
therefore the sign of the susceptibility is related to the sign of s mm . We see that, whenever this quantity is positive 
(something that cannot happen in the canonical ensemble), the susceptibility is negative, and, by continuity, it will 
be negative also in regions where s mm is negative. 

In the left picture of Fig. [30] we plot s(e, to) as a function of to for different values of e, clearly showing the regions 
of convexity of s. The right picture offers a 3D view of s(e,m). The plot of s(e,m) versus m has a bimodal shape, 
and a region of convexity, for e < e c . 

Just for the sake of clarity and readability, we can repeat an argument analogous to the one presented in subsec- 
tion [|VX3l but now with the magnetization to playing the role there played by the energy e. In that case a region of 
convexity in the microcanonical s(e) was associated to the presence of a first order phase transition in the canonical 
ensemble, with an energy jump at a given temperature, without any phase transition in the microcanonical ensemble. 
Now, a region of convexity in s(e, m), seen as a function of m, is associated again with a first order transition in 
the canonical ensemble. The transition is present at temperatures lower than the critical temperature T c , and it is 
expressed by the jump from a positive equilibrium magnetization to a negative equilibrium magnetization, when the 
magnetic field h is varied from + to - . Since the Hamiltonian of this model has a symmetry for q t — > —qi, the 
critical magnetic field, in the canonical ensemble, is h c = 0. Again, no phase transition is present in the microcanonical 
ensemble. The analogy between the two cases is not surprising, since they are both related to the non concavity of 
the entropy function, as seen with respect to two different variables. 



6. The Colson-Bonifacio model for the Free Electron Laser 

In this subsection we discuss an extremely simplified model of the Free Electron Laser (FEL) [54|, Il20| , which can 
be explicitly solved using large deviations [371. [391 ] . It should be remarked that a model of this kind was historically 
introduced by Zaslavsky and coworkers 320] to describe structural phase transitions in crystals. 

In the linear FEL, a relativistic electron beam propagates through a spatially periodic magnetic field, interacting 
with the co-propagating electromagnetic wave. Lasing occurs when the electrons bunch in a subluminar beat wave [52| . 
Scaling away the time dependence of the phenomenon and introducing appropriate variables, it is possible to catch 
the essence of the asymptotic state by studying the classical Hamiltonian 

N 2 N 

h n = y,y~ N5A2 + 2A ^2 sin (^ - v)- ( 25 °) 

The p^s are related to the energies relative to the center of mass of the TV electrons and the conjugated variables Qi 
characterize their positions with respect to the co-propagating wave. The complex electromagnetic field variable, 
A = Ae llp , defines the amplitude and the phase of the dominating mode (A and A* are conjugate variables). The 
parameter 5 measures the average deviation from the resonance condition. In addition to the energy the total 
momentum P = J2j Pj + NA 2 is also a conserved quantity. Most of the studies of this model have concentrated on the 
numerical solution of Hamiltonian (|250|) , starting from initial states with a small field amplitude A and the electrons 
uniformly distributed with a small kinetic energy. Then, the growth of the field has been observed and its asymptotic 
value determined from the numerics. Our study below allows to find the asymptotic value of the field analytically. 
In the first step, similarly to the HMF case, Hamiltonian (|250[) can be rewritten as 
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NH(fx) = TV Q - 5 A 2 + 2A (-m x sin ip + m y cos ipfj (251) 



where m x , my, are defined in Eq. (|127j) and u just below Eq. (|207j) . together with v. Defining the phase of the mean 
field ip' as m x + im y — mexp (itp'), the global variables are [i = (u, v, to, ip 1 , A, ip). 

Going to step two we should first remark that the two field variables A, ip, give a contribution of order 1/N to the 
entropy, which can be neglected. Hence, the function reduces to the one of the HMF model, see formula (|212p . 
Analogously, one obtains the same contributions to the kinetic and configurational entropies of the HMF model, shown 
in formulas ([2"T5]) and ([2"To| . 

Performing finally the third step, after defining the total momentum density as a = P/N , the microcanonical 
variational problem to be solved is 



s(e, a, S) = sup 



iln (u-v 2 ) + s conf (m) 



e = - + 2Am sin (<p' - <p) - 5 A 2 , a = v + A 2 
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Using the constraints of the variational problem, one can express u and v as functions of the other variables, 
obtaining the following form of the entropy 



s(e, <7, 6) = sup 
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The extremalization over the variables tp and ip' is straightforward, since by direct inspection of formula (|253l) . it is 
clear that the entropy is maximized when tp' — <p = —n/2. Then 
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The non zero a case can be reduced to the vanishing a problem using the identity s(s, a, 6) = s(s — cr 2 /2, 0, 6 — a). 
From now on, we will discuss only the zero momentum case since we can absorb non zero momenta into the definition 
of e and S. This has also a practical interest, because it is the experimentally relevant initial condition. 
The conditions for having a local stationary point are 
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where Bi nv is defined in formula ()220|) . It is clear that to = A = is a solution of conditions (|255| and (|256() : it 
exists only for positive e. We will limit ourselves to study its stability. It must be remarked that this is the typical 
initial condition studied experimentally in the FEL: it corresponds to having a beat wave with zero amplitude and 
the electrons uniformly distributed. The lasing phenomenon is revealed by an exponential growth of both A and the- 
electron bunching parameter m. 

The second order derivatives of the entropy s(A, to), computed on this solution, are 



d 2 § 



(0,0) 



,-(o,o) 



dA 2 ' e ' dm 2 

The two eigenvalues of the Hessian are the solutions of the equation 
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The stationary point is a maximum if the roots of this equation are both negative. This implies that their sum 
(—2 + 6/e) is negative and their product (—26/e — 1/e 2 ) is positive. Recalling that we restrict to positive e values, 
the condition for the sum to be negative is e > 6/2 and the one for the product to be positive is e > —1/(26) with 
6 < 0. The second condition is more restrictive, hence the only region where the solution to = A = exists and is 
stable is e > —1/(26) with 6 < 0. When crossing the line e = —1/(26) (6 < 0), a non zero bunching solution (to ^ 0) 
originates continuously from the zero bunching one, producing a s econd ord er phase transition. This analysis fully 
coincides with the one performed in the canonical ensemble in Ref . [1551 . Il56| . The maximum entropy solution in the 
region complementary to the one where the zero bunching solution is stable can be obtained by solving numerically 
Eqs. (|255p and (]256|) . This corresponds to having a non zero field intensity and bunching. 

The equilibrium solution of the Colson-Bonifacio model has been shown to describe well the system only in certain 
energy ranges [33j. Dynamical effects must be carefully taken into account. The dynamics can be studied in the 
N — > oo limit by a Vla sov equation, see Sec[V] The homogeneous zero field state is linearly Vlasov stable for 
6 > 6 C = (27/4) 1 / 3 [l5Cl| . hence, in this region of parameters one cannot expect that the system will relax to the 
equilibrium state. Moreover, as we will discuss, Vlasov equation can display an infinity of quasi-stationary states, and 
the system can be trapped in such states for a time that di verg es with N. The value of bunching parameter and field 
intensity can differ from the equilibrium ones significantly |39j . 



F. The origin of singularities of thermodynamic functions 



We have seen that phase transitions have a paramount importance in the study of ensemble equivalence and in- 
equivalence. Indeed, we have emphasized that inequivalence occurs when phase transitions have different properties 
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in the microcanonical and in the canonical ensemble. The BEG model has shown many aspects of this (see Sec. IIV Bl : 
the microcanonical transition lines do not coincide with the canonical ones; there is a region in the space of thermody- 
namic parameters where there is a first order phase transition in the canonical ensemble while the transition is second 
order in the microcanonical ensemble, which moreover presents a negative specific heat. There is another region where 
both ensembles have a first order phase transition, but with different properties, i.e. an energy jump in the canonical 
ensemble and a temperature jump (with sometimes also a negative specific heat) in the microcanonical ensemble. The 
</> 4 model has shown another kind of inequivalence: instead of being associated to the thermodynamically conjugated 
pair (e, T), we have found that it is related to the pair (m, h). 

Obviously, phase transitions are present also in short-range systems, but in that case ensembles are equivalent. Wc 
remind that, as emphasized in subsection llV A 2l we adopt here the physical point of view about ensemble equiv alenc e 
at first-order phase transitions for short-range systems. At variance with the mathematical physics literature [300| |. 
we do not use the term partial equivalence in cases where there is not a one-to-one correspondence between energy in 
the microcanonical ensemble and temperature in the canonical ensemble. 

Then, the questions may arise about a possible qualitative difference between phase transitions in short-range 
and in long-range systems. This section is devoted to a brief discussion of some results that concern this problem. 
We will give only few details, for two reasons: first, this subject lies outside the purpose of this review; second, 
two recent reviews a nd a book have appeared dealing with this problem where the interested reader can find more 
details [H], 1 1071 . l26l| . We will discuss the origin of the analytical properties of entropy and of free energy that, in 
turn, are associated to the presence of phase transitions. 

The c/> 4 model can be studied without the magnetization constraint in the microcanonical ensemble, and without 
the presence of an external magnetic field in the canonical ensemble. This is what has been done in Ref. jl26| . where, 
as we have mentioned in subsection IIVE 5[ a second order phase transition has been found. According to what we 
have emphasized at the end of subsection IIV A 3l a second order phase transition implies that the two ensembles are 
equivalent. Nevertheless, the presence of the transition means that the second derivatives of s(e) and of 4>(P) are 
discontinuous, the first at the transition energy s c and the second at the corresponding transition inverse temperature 
f3 c . The non analyticity of the free energy is the usual signature of phase transitions. What we are interested in here 
is the origin of this non analyticities. 

The entropy function s(e, m) of the model, previously studied, is an analytic function in both arguments. We 
have also seen that the inequivalence with the canonical ensemble, where a first order phase transition is present, is 
due to the concavity properties of s(e, m). If we study the system without the magnetization constraint, we can use, 
as before, the large deviation method, with the only difference that at the third step of the procedure, we maximize 
in Eq. (|245|) also with respect to m. This is simply because the entropy s(e) is, for each e, the maximum over m 
of s(e,m). Since there is a point of non analyticity in s(e), at the energy e c of the second order phase transition, 
this means that the process of maximization of the analytic function s(e,m) leads to a non analytic function s(e). 
This is due to the bimodal shape of s(e, m), see Fig. [301 This bimodal shape would not be possible in a short-range 
system, where s(e, m) has to be concave with respect to m at each value e. This argu ment has been recently used by 
Kastner a nd collaborator s [l97l | , with particular emphasis on the <ft 4 model jl7Ct Il7l| and on the mean- field spherical 
model fl96l . [l98l l200l | . 

In subsection IIV A II we have emphasized only the concavity with respect to e. However, in sh ort-range systems 
the entropy is concave also with respect to other macroscopic variables, e.g. magnetization |l6l| . Of course, there 
might be ranges where the concavity is not strict, associated to first order phase transitions. This means that, for 
short-range systems, a possible non analyticity in s(e) could not derive from the maximization of an analytic s(e,m), 
but that already the function s(e, m), over which the maximization is performed, is non analytic. The mechanism that 
can give r ise to non analyticities of the entropy in short-range s ystems has bee n emphasized by Pettini and coworkers 
(see [H|, l26l| and references therein). These authors prove jl57l Il58l . Il59l | that a necessary condition for having 
a phase transition in short-range system is the presence of topological changes in the configurational space subset 
M(v) = ({qi}\U({qi})/N < v), which is nothing but the level set v of the potential energy U. This topological change, 
according to Morse theory, is signalled by the presence of "critical points" in M(v), i.e. points where the differential 
of U vanishes. This idea that phase transitions have an origin in some topological and analytical properties of the 
phase space could of course be extended to systems with long-range interactions. Indeed, it has been extended, and 
detailed calculations of the critical points of the HMF model [8l], Ell and of the fc-trigonomctric model 0, H Q have 
been performed, allowing to show analytically that a topological invariant (the Euler characteristic) has a jump at the 
phase transition point. However, results in this field raise appare nt parado xes. Indeed, by considering the mean-field 
4> 4 model with varying strength of the mean-field term, Kastner [l7dl . Il7l| prove that nothing special appears in the 
analytical and topological properties of the potential energy at the second-order phase transition point. Hence, the 
mechanism here at work to create a phase transition must be different and it is natural to think that it is indeed 
related to the extremalization of a thermodynamic analytic function (entropy or free energy), independently of the 
presence of critical points in the potential. 
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However, although phase transitions in long-range systems, as just explained, can derive also from a mechanism 
that is not present in short-range systems, still they could also derive from topological changes. The problem to solve 
is to select which of the "critical points" , whose number grows with N, will turn out to determine the few non analytic 
points of the thermodynamic functions. Recently, it has b een s hown that the non analyticities due to "critical points" 
can become infi nitely wea k in the thermodynamic limit [200j ] (solving also a contradiction arisen in a study of the 
spherical model |272l l273l |). i.e., the order of the derivative of the entropy which is discontinuous becomes larger and 
larger when N grows. In order to explain how a disco ntinuity in a first or second order derivative of the entropy 
can survive in the thermodynamic limit, Kastner [X09t | introduces a criterion of "sufficient flatness" of the critical 
point. An application to the HMF and to the k- trigonometric model confirms this criterion, selecting the correct 
phase transition point in the thermodynamic limit [l99j. 

All these results concern necessary conditions and the much harder problem of finding sufficient conditions has not 
yet been solved. For instance, for a generalization of the /c-trigonometric model, it has been found @ that a phase 
transition occurs at an energy where no topological change of the phase space manifold is present. 



V. OUT-OF-EQUILIBRIUM DYNAMICS: QUASI-STATIONARY STATES 



In this section, we will give a brief introduction to the kinetic equations used for long-range systems, i.e. Klimon- 
tovich, Vlasov and Lenard-Balescu equations. We will present the numerical evidence of the existence of quasi- 
stationary states in the iV-particle dynamics of the Hamiltonian Mean Field (HMF) model. These states, that are 
out-of-cquilibrium and have a lifetime that increases with system size N, turn out to be related to stable station- 
ary states of the Vlasov equation. Selecting the dynamics of a single particle and treating the rest as a bath, we 
will find that the one-particle velocity distribution function obeys a Fokkcr-Planck equation with nonlinear diffusion 
coefficient. Solving this equation, we find cases where diffusion can be normal or otherwise anomalous. A theory, 
introduced by Lynden-Bcll for gravitational systems, which relies on a "fcrmionic" entropy, allows us finally to predict 
the one-particle distribution function in the quasi-stationary states for a specific class of initial conditions. 



A. Kinetic equations 



1. Klimontovich, Vlasov and Lenard-Balescu equations 



Here, we will discuss the equations that describe the evolution of the one-particle distribution function for systems 
with long-range interactions. 



The first kinetic equation for a Hamiltonian TV-body system was derived by Boltzmann 51|. Boltzmann's theory, 
in which particles interact only through binary collisions, describes diluted gases with short-range interac tion s. In 
order to take into account collisions determined by long-range Coulomb and gravitational forces, Landau [214| and 
Chan drasckhar (90j modified Boltzmann's collisional term. Mean-field co llecti ve effects were first considered by Vlasov 
[3101 ] and led to the Vlasov equation, later refined by Landau himself 215 [. A tr eatm ent of col lision terms in the 
context of the Vlasov-Landau approach was developed by Lenard and Balescu ptl |222| | (see Ref. [l53l | for a review): 
collective effects were effectively taken into account by a dielectric function. These theoretical approach es have also 
led to the development of kinetic theories for point vort ices in two-dimensional hydrodynamics [94 , Il34| and for non 
neutral plasmas confined by a magnetic field [1351 . Il36j . A detailed recent discussion of the derivation of Landau, 
Vlasov and Lenard-Balescu equations and of the approximations involved can be found in Ref. [95| . 

The relation between the jV-partic le dynamics and the soluti on o f the Vlasov equation has been studied for mean- 
field systems by Neunzert [25ll . |252| and Braun and Hepp [66l l289j |. The latter proved that, for sufficiently smooth 
potentials, the distance between two initially close solutions of the Vlasov equation increases at most exponentially 
in time. If we apply this result to a large N particle approximation of a continuous distribution the error at t = is 
typically of order 1/y/N, thus for any e and any particle number N, there is a time t up to which the dynamics of 
the original Hamiltonian and its Vlasov description coincide within an error bounded by e. The theorem implies that 
this time t increases at least as In N. 

For what concerns the methods to derive the Vlasov equation and higher order kinetic equations, two main per- 
turbativc approaches h ave b een followed. The first one begins with t he L i ouvi lle equation and, cither by the usual 
BBGKY expansion [28|, Il83l | or by some projector operator formalism [l93l l312j leads to the kinetic equation for the 
one-particle distribution fu nction making some particle decorrelation hypothesis. The second approach is based on 
the Klimontovich equation |209l . 12271 . |253| . In both approaches, the small parameter of the perturbative expansion 
turns out to be 1/N, where N is the number of particles. These two methods give completely equivalent results. In 
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the following, wc will discuss the method based on the Klimontovich equation. The Vlasov equation will come out at 
leading order, while the Lenard-Balescu correction term will appear at order 1/N . 



2. Derivation of Klimontovich equation 
The general Hamiltonian we will discuss is of the form 

N p2 

H N = Y J ^ + U({Qj}), (259) 

i=i 

where we have used the coordinate ®j for particle j because, in the following, we will mainly treat models where 
positions are specified by an angle and 

N 

u(o 1 ,...,Q N ) = Y,v(e i -e j ). (260) 

We do not use here Kac's scaling of the potential (which amounts here to put a 1/N prefactor in the above formula), 
because we want to maintain the derivation at the general level, which includes both short and long-range interactions. 
When relevant, we will make comments on where Kac's scaling would produce effects for long-range interactions. 
The state of the ./V-particlc system can be described by the discrete one-particle time-dependent density function 



i N 

U (6,p, t) = - y £s(e-Q J (t)) 6 (p - Pi (t)) , (261) 
i=i 

where 5 is the Dirac function, (0, p) the Eulerian coordinates of the phase space and (0j , Pi) the Lagrangian coordinates 
of the TV-particles whose dynamics is given by the 2N equations of motions 



9, = Pj, (262) 
dU 



Pj = (263) 



'j 

Differentiating with respect to time the one-particle density (|261[) and using Eqs. (|262p and (|263[) . one gets 

dJ4 W A = -JfT, p 4e 5 {e ^ (t)) 5 {p ~ p i {t)) + ^T^lL He -^ {t)) 5 (p ~ P > {t)) ' (264) 

j 3 3 

Taking advantage of the property of the Dirac (5-function, a8(a — b) = bS(a — b), it is possible to rewrite this equation 



as 



= -^pp (o Q i (*)) 5 (p - p i (*)) + E %iL 5 V ~ ^ (*)) 6 (p ~ p i (*)) ■ ( 265 ) 

3 j 

where 

v(6,t)=N J de'dp'V(6-6')f d (6',p',t) , (266) 
which leads to the Klimontovich equation 



dfd . df d dvdf d 

^r +p ^e-d9^p- = - (267) 

One notes that the above derivation is exact even for a finite number of particles N. However, this equation contains 
the information about the orbit of every single particle (since /<j depends on the 2N Lagrangian coordinates of each 
particle, (Oj,P,)) which is far more than we need. Hence, Klimontovich equation is especially useful as a starting 
point for the derivation of approximate equations that describe the average properties of the system. This is what we 
discuss in the following subsections. 
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The more widely used approach to kinetic equations begins with the Liouvillc equation. This equation governs the 
time evolution of the probability density in the full 2N dimensional phase space (9i,p%, . . . , 0n,Pn)- To arrive at the 
Liouville equation, let us first define the Klimontovich distribution function 

N 

F d (9 1 , Pl ,e 2 , P2 , . . . ,9 N ,p N ,t) = H 5 (9 t - Qj (t)) 5 { Pj - P 3 (t)) , (268) 

3=1 

where (0,(i), Pi(t)) are again the Lagrangian coordinates. Deriving (|268| with respect to time, and using the equations 
of motion, one obtains 

dF d " dF d ^ dU{9 u ...,9 N ) dF d 

■aT + E» AST - E w % = °" (269) 

1= 1 I— 1 

This is the Liouville equation for F d . It has been derived for a particular distribution function, which describes a 
single point in the 2N dimensional space. The Liouville equation for a generic distribution function is obtained by 
an averaging procedure which leads to the introduction of a smooth density p(6\,pi, 9 2 ,P2, ■ ■ ■ , 9n,Pn, t), which can 
be shown to obey the same equation (|269|) . This smoothing procedure is similar to the one that leads to the Vlasov 
equation, that we will describe just below. However, while the Liouville equation for p is exact (as the Klimontovich 
equation for f d ), the Vlasov equation is not exact and we will indeed discuss finite N corrections. It is well known 
that the fluid described by the density p is incompressible. 



3. Vlasov equation: collisionless approximation of the Klimontovich equation 

Determining f d (9,p, t), which characterizes whether a point particle is to be found at a given point (9,p) in Eulcrian 
phase space, would imply to solve the equations of motion (|262I263[) with initial conditions ({O,(0), Pj(0)}). In general, 
this is a difficult task and typically not feasible for nonlinear systems. Alternatively, one can define an averaged one- 
particle density function using an infinite number of realizations prepared according to some prescription. One could 
for instance consider a large number of initial conditions, close to the same macroscopic state. Let's define the density 
of such initial macroscopic state as /m({©i(0), P,(0)}). The average one-particle density function fo is obtained as 

fo(9,p,t) ee (f d (9,p,t)) = J 11 dO^dP^O)/^©^),^)})/^,^ , (270) 

i 

where the dependence of f d on ({©;(0), p(0)}) comes from the solution of the equations of motion that enter the 
definition of f d , see Eq. (|261[) . 

The equation for the time evolution of the smoothed distribution fo is obtained by again averaging over fi n . Before 
doing it, let us introduce the definition of the fluctuations 8f around the smooth distribution 

fd(0,p,t) = f (9,p,t) + -^=6f(9,p,t). (271) 
v N 

Evidently /o is independent of the detailed microscopic properties of the initial state and depends on it only through 
fi n , while 6f, like f d depends on all Lagrangian variables of the initial state. Obviously, the average over fi„ of Sf 
is zero. The introduction of the prefactor l/V~N takes into account the typical size of relative fluctuations. Indeed, 
fd ~ fo is the difference between a singular distribution, containing Dirac deltas, and a smooth function; therefore the 
statement that this difference is of the order 1/VN has to be interpreted physically. Its meaning is the following: if 
we integrate f d — fo in 9 and p in a volume which is small compared to the total available volume, but large enough 
to contain many particles, then the value of this integral, both at equilibrium and out-of-equilibrium, is of the order 

i/Vn. 

Inserting Eq. (|27Tj) into Eq. (|266|) leads to 

v(6,t) = (v)(6,t) + -^=5v(9,t), (272) 
where the first term comes from the average over and coincides with 



(v)(9,t)=N / d9'dp'V(9-9')f (9',p\t) , 



(273) 
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while the second term defines Sv, which depends on all the details of the initial state. If we had used Kac's scaling the 
TV prefactor in formula (|273[) would be absent, making the average potential intensive. This latter scaling is commonly 
used [289j | and would be more appropriate to derive the Boltzmann equation. 

Inserting both expressions (|27ip and (|272[) in the Klimontovich equation (|267[) . one obtains 

dfo df d(v) df _ i fasf asf d8vdf a{v)asf\ idSvdSf 

dt P 09 dd dp VTV V dt P 89 89 dp 89 dp J TV 89 dp ' 1 ' 

Taking the average over /j„ of this equation leads to 

dfo 8fo_8( 1 vldfo ]_/86vd6l\ 

dt P 89 89 dp N\d9 dp / 1 ' 

It is important to stress that above equation is still exact. Up to now we have not even made any hypothesis about 
the long or short-range properties of the potential V. In standard k inet ic theory, Eq. (|275|1 would correspond to the 
first equation of the BBGKY hierarchy (see e.g. Chapter 4 of Ref. |253| ). For short-range interactions, the r.h.s. of 
this equation would originate the leading contribution to the collision term of the Boltzmann equation, while the third 
term of the l.h.s. would be negli gible close to equilibrium. On the contrary, for long-range interactions, it will turn 
out that the r.h.s is of order 1/TV |26ll2l3 . l222| . Indeed, the scaling with 1/VN in Eqs. (j27l"j) and ([272)) is appropriate 
for long-range interactions. Therefore, for long-range interactions, the third term on l.h.s. will be the leading term 
and, in the limit TV — > oo, one ends up with the Vlasov equation 

dfo dfo d(v) df Q 

^T +P ^9-^9-^p- = °- (276) 

To distinguish the effects due to discreteness (finite value of TV) from the collective effects grouped on the l.h.s. of 
Eq. (|275[) . the r.h.s. is usually referred to as collisional term. This is the origin of the name collisionless Boltzmann 
equation sometimes used for the Vlasov equation. It is however important to underline that there are no true collisions 
for long-range systems: granular effects, discreteness effects or finite TV corrections would be more ap prop riate names. 

The Vlasov equation has wide applications in gravitational systems [47[ and in plasma physics [2531 ]. A typical 
question that is posed is the one of stability of the stationary solutions. In particular, the stability of stationary 
homogeneous solutions has important applications in plasma physics for the description of Landau damping of for 
beam-plasma instabilities. In the following subsection, we will discuss in some detail the stability of stationary 
homogenous solution for the Hamiltonian Mean Field model. 



4- Stationary stable solutions of the Vlasov equation: application to the HMF model 



Let us concentrate on homogenous states, for which the one-particle distribution function does not depend on 9, so 
that fo = fo(p,t). For these distributions, using Eq. (|273p . one immediately gets that (v) = const., and consequently 
the Vlasov equation (|276|) reduces to 



f(p, f ) = 0. 



(277) 



Therefore, homogeneous distributions are stationary. However, stationarity does not imply stability. Indeed, the 
stability of stationary spatially homogeneous solutions can be studied by subtracting Eq. ()275|) from Eq. (|274jl . One 

gets 



dSf 85 f 8Sv dfp 8(v) dSf _ 1 
~~dt +P ~dT ~ ~d9~dp ~ ~df~dp ~ VTV 



d5v 86 f I dSv dSf 



d9 dp 



d9 dp 



(278) 



For times much shorter than VTV (or equivalently for TV — * oo), we may drop the r.h.s. of Eq. (|278|) , which contains 
quadratic terms in the fluctuations. As the last term of the l.h.s. v anishes, since (v) = const., the fluctuating part of 
fd, Sf, obeys the linearized Vlasov equation [66L 12151 . l24ll . I289L l310l | 



d6£ 9S£ _ dSvdfo = () 
dt P d9 d9 dp 



(279) 
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This equation could also be easily obtained by linearizing directly the Vlasov equation (|276|) . Looking for plane wave 
solutions 

5f(0,p,t) = f(p) jW-"*) (280) 
5v(6,t) = ie l ( M_wi ), (281) 

we obtain 

- ilo f(p) +pik f( P ) - ik Af (p) = (282) 

which leads to 

f(p) = 4^-foiP)- ( 283 ) 
pk — to 

This analysis is completely general and applies to all one-dimensional models. 

To be more specific, let us consider the Hamiltonian Mean Field model (|126p . In this particular case, the potential 
appearing in the Vlasov equation Eq. (|276p is 

p2ir />+oo 

(v)(0,t)= da dp[l-cos(6-a)]f (a,p,t). (284) 



We now use Eqs. (|280I281I283[) . where for the HMF model k is an integer because the potential is 27r-periodic, in the 
definition of the fluctuations of the potential 

6v(0,t)= da dp [l-cos(6-a)]Sf(a,p,t) . (285) 

JO J -oc 

We obtain 

Sv(6,t)=Ae l ( kd - ujt ) = -^fc(4i+4-i) r^dp^^Ae^-^. (286) 

J-oo Pk - oj 

Introducing the so-called "plasma response dielectric function" (see Chapter 6 of Ref. [2531 ] ) 

D(w,fc) = l + 7rfc(<J M +(Sfe,-i) / dp 4^^, (287) 



pk — uj 

Eq. (|286[) can be rewritten as D(ui, k)A cxp(i(k9 — u>t)) = 0. In order to get non vanishing solutions for A, the relation 

D{uj,k)=Q (288) 

must be satisfied, which leads to the dispersion relation to = uj(k), linking the frequency with the wavevector. From 
the definition (|287|) . it is evident that the only possible collective modes arc k = ±1. The denominator in the integral 
on the r.h.s of Eq. (|287[) must be treated with care. Indeed, a difficulty arises if one attempts to obtain solutions oj(k) 
of Eq. (|288|) corresponding to purely oscillatory solutions of the linearized Vlasov equation (|279[) . i.e. with lm(cj)=0. 
These solutions are important because they lie at the boundary between stable and unstable modes, which correspond 
to negative and positive values of Im(w), respectively. If one makes the substitution of real values of u> into the integral 
mentioned above, one notes that the integrand has a pole at p = oj/k and therefore the integral is not well defined. 
The prescription for performing this singular integral is to deform integration contour of p in such a way to circulate 
around the pole p = oj/k. This method was introduced in plasma physics by Landau [215| and the deformed contour 
is called Landau contour. The deformation of the contour is equivalent to the displacement of the pole lu — > lu ± iO. 
More rigorously this is expressed by the Plemclj formula 

lim 5—— = V — - — T inS(x - a) (289) 

7^0 x ~ a ± x — a 

where V denotes the principal value, which applies when integrating over x. The second term on the r.h.s comes 
from integrating on a semi-circle around the pole. The direction to be chosen for the displacement of the pole is 
determined if one uses the Laplace transform in time, suited for the solution of the initial value problem associated to 
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our kinetic equation. We will use this technique in the next subsection, treating the Lenard-Balcscu equation. The 
Laplace transform is defined for Im(cj) > 0. From this, we derive that D(u>, k) is defined by Eq. (|287p for Im(w) > 0, 
and by its analytic continuation for Im(w) < 0. In particular, this means that for Im(aj) = the rule for encircling 
the singularity at p = cu/k is obtained by the Plcmelj formula (|289p by posing u> — > u) + iO. Applying this rule, we 
find that for k = ±1 the dielectric function for real values of u> is given by 



D(u,±l) = l + nV 



-oo PT U 



(290) 



We see in particular that D(—uj, —1) = D*(lo, 1). As we mentioned, the real solutions ui(k) obtained from equating to 
zero the dielectric function correspond to the boundary between stable and unstable modes. From the last equation, 
posing equal to zero both the real and the imaginary parts, we see that in order to have a real w as a solution of 
D(u>, ±1) = 0, it is necessary that /q(±u) = 0. If one considers initial distributions with a single maximum at the 
origin in momentum space, one obtains, for both D(ui, 1) and D(cu, —1), uj = 0. Hence, the real part of Eq. (|290[) will 
be zero if 



1 = 1 



+ 00 



dp = 



(291) 



This gives the stability boundary of the homogeneous distribution fo(p). To deduce where the stability region is 
located in the functional space of fo(p), o ne ca n perform an analysis based on the Nyquist criterion, from which one 



finds that the stability condition is / > [187 



The linear stability analysis discussed above appeared in a series of different papers foil. IlldL lllll . Ill2l . Il87l \262\ , 
The stability boundary can be also derived using the so-called energy-Casimir method |316| . 

Condition (|29ip is a functional equation in fo(p)- By a convenient parametrization of fo(p), one can obtain a 
stability boundary in a finite dimensional control parameter space. A relevant parametrization for the HMF model is 
the one in terms of the energy per particle e. Therefore, we will briefly discuss below some examples where a critical 
energy density e c appears, above which the homogeneous solution is stable. 
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FIG. 31: Three examples of stationary homogenous solutions of the Vlasov equation. The Gaussian (dashed), the water-bag 
(dotted) and the power- law fEq. 1295]) in the case v = 8 (dash-dotted). 



• The first one is the Gaussian distribution 

f S (p) = ^-\[fcM-f3p 2 m (292) 

Z7T V ZTT 

(see Fig. [3Tjl . One gets 1=1 — (3/2. The condition 1 = coincides with the equilibrium statistical mechanic s 
result that the critical inverse temperature is /3| = 2, and its associated critical energy is e c & = 3/4 [Tll . ll86l . ll87( . 

• The second example is the water-bag distribution 

Ub(p) = ^7?-MP + P°)-0(p-Po)} (293) 
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where is the Heavyside function. This distribution, also shown in Fig. [311 has been often used in the past to 
test numerically the out-of-equilibrium properties of the HMF model. In this case, one obtains 1=1 — 1/(2j>q) 
which leads to a smaller critical energy e^ h = 7/12. 

Another example studied in the literature is the q-Gaussian 

/t(p)~ [1 - a(l - q)p 2 ]^ , (294) 

with a positive. This function has a compact support for q < 1, while it decays with a power law tail for q > 1; 
we recover the Gaussian for q = 1. The water-bag distribution is recovered for q — > — oo. For q > 5/3, the 
second moment of this distribution, and thus the average kinetic energy, is not finite, and therefore a cut-off 
to keep the energy finit e is used in actual implementations. For the cases with finite kinetic energy, i.e., for 
q < 5/3, one gets p. [Til ITTol ] = | + 2 (t-3q) ' ® ne easily verifies that the previous values of the critical energy 
for the Gaussian and the water-bag distribution are recovered for the proper values of q. 



The last example [3171 ] is a distribution with power-law tails 

frW-T+BKP <295) 

characterized by the exponent v. The unity, added in the denominator to avoid the divergence at the origin p = 0, 

does not affect neither the asymptotic form, nor the theoretical predictions. The parameter po = g'^Jy^ y 

controls the kinetic energy density K/N and the normalization factor is A v = v sin(7r / V) / '{2npo) . The exponent 
v must be greater than 3 to get a finite kinetic energy: we have used v = 8 (see Fig.l3"Tj). Note that the power-law 



distribution cannot be included in the q-exponcntials family 30 lj . although it has similar power law tails at 

sin(7r/ v) 
4sin(37r/z^) 5 



large \p\. Distribution (|295p is stable above the critical energy density £pi = 5 + 4s'in(3wl) 1 which corresponds 



to £pj = 0.75, 0.625 and 0.60355 ... for v = 4, 6 and 8 respectively. 
• Finally let us consider a more general distribution, namely a mixed distribution between / w b and / g , defined as 

f a ( P ) = (1 - o)/ wb (p) + af s (p). (296) 

Thanks to the linearity of the qua ntity (|29ip with respect to the distribution, the critical energy density for this 
mixed distribution f a is obtained [316l | as a linear combination of the previous results. 

The condition (|291| defines therefore a critical energy e c which is, in general, different from the critical energy 3/4 
where the second-order phase transition of equilibrium statistical mechanics is located. However, as expected, the 
two values coincide for a Gaussian distribution. 

All the above distributions are thus stationary solutions of the Vlasov equation (|276p . However, it is important 
to realize that they are Vlasov stable stationary solutions among infinitely many others and there is no reason to 
emphasize one more than the others. 



5. The Lenard-Balescu equation 



We have so far concentrated on the collisionless dynamics as described by the Vlasov equation. When using this 
equation, one implicitly assumes that the particles of the system move under the influence of the average potential 
generated by all the other particles. This means that the acceleration of all the particles is given by a single function, 
i.e., by —d(v)/d6. However, this assumption is not valid for arbitrarily long times. The presence of individual 
"collisions" invalidate the assumption that the acceleration of all the particles derives from a single mean field function: 
"collisions" perturb particles away from the trajectories they would have taken if the distribution of particles in the 
system were perfectly smooth. In long-range systems, however, we have seen that for times shorter than N (thus 
for times that can be very long), the collisionless approximation is valid. For larger times, the Vlasov equation is 
not valid and we have to consider the effect of "collisions" . Under this effect, particles will deviate from the orbits 
determined by the Vlasov equation on a characteristic time that is called relaxation time. This can be justified from 
the fact that a stable stationary solution of the Vlasov equation, as determined from the analysis described in the 
previous subsection, will eventually be perturbed by the effect of "collisions" . 

Let us then go back to the exact Eq. I|275p and consider the right hand side. At the next level of approximation, 
i.e., the level 1/N, we can determine the right hand side using the solutions for 5v and Sf as determined by the 
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collisionless dynamics, given by Eq. (|279p . If again we restrict to the case in which Jq, a stable stationary solution of 
the Vlasov equation, does not depend on 9, at this level of approximation our kinetic equation will therefore be 



df 1 /dSvdSf\ 



dt N\d9 dp I 



(297) 



This equation is valid up to a time where the right hand side makes /o leave its stability basin as determined by 
the Vlasov equation. As already anticipated, one can expect this time to be of order N. This equation is called the 
Lenard-Balescu equation, and its solution requires the determination of the correlation function on the right-hand-side 
of Eq. (|297[) . To achieve this goal, it is necessary to use the spatio-temporal Fourier-Laplace transform. Using this 
technique, one is able to determine the correlation expressing the collisional term as a function of its value at the 
initial time. Wc thus have to solve an initial value problem. 

The spatio-temporal Fourier-Laplace transform of the fluctuation of the density Sf is defined by 

5f(k,p,u) = J — j dt e-^ 9 -"*) 6f(0,p,t), (298) 

associated with a similar expression for the fluctuation of the potential 6v. As usual with Laplace transforms, 
Sf(k,p,ui) is defined by the last equation only for Im(aj) sufficiently large. For the remaining part of the complex w 
plane, it is defined by an analytic continuation. The inverse transform is 

+oo „ , 

5f(6,p,t)= J2 J ^ 8f(k,p,u,), (299) 

k— — oo 

where the Laplace contour C in the complex ui plane must pass above all poles of the integrand. The inverse transform 
has a sum over the integer values of k since the coordinates 9 take values from to 2n. In fact, we will determine 
the right hand side of Eq. (|297p taking as an example the HMF model, as we have done in the previous subsection. 
More generally, there will be an integral over k, but the analysis will be perfectly equivalent. 

If we multiply Eq. (|279|) by e _ H fee_a; *) and integrate over 9 from to 2n and over t from to oo, wc obtain 

- 5f(k,p,0) - iLu5f(k,p,Lu) +ik P 5f(k lPl Lu) - ik&v{k,uj)f Q {p) = 0, (300) 
where the first term is the spatial Fourier transform of the initial value 

/■27T in 

5f(k,p, 0) = J — e~ M 8 f{0, p, 0) , (301) 

and it arises from the integration by parts in obtaining the Laplace- Fourier transform of dSf /dt. The above equation 
can be rewritten as 

oj{k,p,uj) = —Sv{k,u) + — y, (302) 

where one identifies a first "collective" term depending on the perturbation of the potential, and a second one which 
depends on the initial condition. Combining Eq. (|302[) with the Laplace-Fourier transform of the fluctuations of the 
potential (|285|) . i.e. 

. r+oo 

Sv(k,u) = -7T (4,1 + 4,-1- 2<5 fe ,o) / d P Sf(k,p,uj), (303) 



and integrating (|302p over the p variable, gives 



+oo 



dp 6f(k,p,w) 



+ 00 



1+TTfc (4.1+4.-1) / dp' f ° {p>) 



(jp'k — lj) 



tX 



Sf(k,p,0) 



oo 



i(pk — ui) 



where one recognizes the plasma response dielectric function (|287[) in the parenthesis. Equation (|303[) can thus be 
rewritten as 

Hk M = _*(fr.i+>.-i-2* fc ,o) / + " Sf(k,P,0) 
D(ui,k) J-oo i(pk-u>) 
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We see that the Laplace contour C for the inversion formula must pass above all zeroes of D(k,ui). We can consider 
that these zeros will all be located in the half-plane Im(w) < 0, since otherwi se th e problem of the 1/N perturbations 
to a stable stationary solution of the Vlasov equation would not make sense |227| . 

One can then use these expressions to compute the collisional term appearing on the right-hand-side of Eq. (|297[) . 
Forgetting temporarily the factor 1 /N and the derivative with respect to the variable p, one has 



(306) 



\k— — oo k' — — oq 



E E J duj I du'ikeW+^-^'W (6i(k,u)8fW,p,iS)) , (307) 

k— — oo k' — — oo 

which relies on evaluating the correlation (5v{k,uj) 5 f{k' , p, u/)) . The presence of the factor k in the last equation 
allows us to forget the k = contribution to Sv(k,Lu) in Eq. (|305[) ; this was expected, since the constant component 
of Sv cannot contribute to the force. Using Eq. (|302[) . one finds 

/— \ k'f'tn) i~ \ ( Sv(k, uj) 8f{k',p, 0) ) 

{6 V (k,.)6f(k',p,.>)) = ^( HktU)H w) + \ .^^^ I . (308) 

The first term on the r.h.s. corresponds to the self-correlation of the potential, while the second one to the correlation 
between the fluctuations of the potential and of the distribution at time t = 0. Let us consider separately the two 
terms of the last equation. 

From Eq. (|305[) . neglecting the k = contribution and taking the statistical average, we have 

{Sv(k,u)6v(k ,u)) = ~- — - / dp dp —— -— — 309) 

D(uj,k)D(uj',k') J-oo 7_oo i(pfc-w)i(p'fc'-w') 

tt 8 kl - w + 4,-i) f + °° Ap f +0 ° dp , [h{p)5{p-p') + ^ P y)] 



2 D(u, k)D((j', -k) J-oo J-oo (pk~u;){p'k + u;') 

where we have replaced the autocorrelation of the fluctuation of the distribution at t = (see Appendix D) by 

(Sj(k,p,0W(k',p',0))= S -^[f (p)6(p-p')+fi(k,p,p')] , (311) 

with the first term expressing the single particle contribution to the correlation, while in the second term the function 
fi(k 7 p,p') comes from the contribution of different particles. This last function is smooth, b ut ot herwise arbitrary, 
since it is related to initial conditions of our initial value problem. However, it can be shown [227| that, going back 
in the time domain with the inverse Laplace-Fourier transform, the contribution of this function to the correlation 
decays in time. Therefore we can consider only the first term in Eq. (|311[) . obtaining 

iTii, \Tiv >\\ 77 s k,-k' {Sk.i + 8k.-i) f + °° , fo(p) ,„ 10 -. 

(ov(k, ui)ov(k ,u> )) = — — = ! / dp—- tt- 312 

2 D(uj,k)D(Lu\-k) Loo (pk - cj)(pk + w') V ; 



Considering again only the contributions that, after integration in u> and to', do not decay in time, it can be shown [2271 ] . 
through the use of the Plemclj formula (]289[) . that [(pk — oj)(pk + oj')]~ 1 can be substituted by (2ir) 2 6(uj + uj')8(uj — pk). 
In addition, using the property D{uj, k) = D*(—lu, —k), one finally ends up with the result 

(Sv(k,w)5v(k',oj')} =27r 3 4,_ fe , 5 ^ + U) '\ [ dp f (p)6(u; - pk), (313) 



D(w,k) 

which vanishes except for \k\ = 1: only the cases k = —k' = ±1 therefore contribute. 

We now consider the second term of Eq. ()308j) . Using again Eq. (|305|) without the k = term we have 

(si(k,u)Sf(k\p,0)) ntfki+Si ,1 r+°° (5](k,p',Q)Tf(k',p^)) 

/_ TT {Ok,l "I" Ok,-l) j dp'^— -— -L. (314) 



i(pk'-uj') D{u),k) J-oo i{p'k-uj)i{pk' -uj') 
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As for the analysis of the first term of Eq. (]308[) . we substitute the initial time correlation in the last integral with 
the first term in Eq. (|311[) . and afterwards we substitute [(pk — uj)(pk + U)')]~ l with (2it) 2 6(uj + uj')6(uj — pk). We 
therefore obtain 



6v(k,uj,k) 6f(k',p,0) 
i(k'p — u>') 



2n 2 6 kt _ k , (6 kjl + 
D(u,k) 



fo(p)6(uj + uj')6(uj - pk) . 



(315) 



From Eq. (|3 13[) . one gets the contribution to (|307|) of the first term of Eq. (|308[) . Exploiting the presence of the 
factors 6 k . -w and 6(uj + uj'), this contribution is 



ITT 

T 



+00 „ 

y2 duj (4,1 + 6 k -i) 



k 2 m 



k— — 00 



pk — UJ 



D(uj,k) 



/o(w) 



dp'f ( P ')6(LJ- P 'k) 



/o(-w) 



tX 



dw/5(p) 



p — UJ 



in8(p — uj) 



D(lj,1) 



1 



P + uj 



in5(p + uj) 



/o(-w) 



f +00 

- 2 ' 



i/o(p)/o(w)%-w) ■ 



(316) 

317) 
(318) 



Passing from the second to the third line we have used again the Plemelj formula, that allows us to integrate on 
the real u> axis. Note that in this case the rule of singularity encircling is the opposite of the usual one. i.e., it is 
uj — > uj — iO, since t he ui of the (pk — uj) term in the denominator in the first line comes from the integration in uj' , 
that gives uj = —uj' [227| . Passing from the third to the fourth line, the variable of integration has been changed from 
uj to — uj in the last two terms of the third line, and we have also used the property D(uj,k) = D*(—u>,—k). The 
contribution of the second term of (|308p to Eq. (|307|) is obtained from (|315j) . Exploiting again the factors 6 k ,-k' and 
6(uj + uj'), this contribution is 



+00 



• E /<WoW(f w +&.- 1 )%=^ 

™ Jc D{uj, k) 



+OO 



dojfo(p) 



6(uj -p) 



duj ■ 



1 



D(u>,l) 

2fo(p)fo(v)6(p-u 



D(w,l) 



D*(w,l) (319) 

(320) 



We now have all elements for the determination of the right hand side of the Lenard-Balescu equation (|297p . 
Inserting again the 1/N factor we obtain the following result which is valid only in one dimension 



dt 



(1^ 



NdpJ^ \D(uj,1)\ 2 



t 1 ,df (p) ,df (uj) 
6{p-uj)[ f (uj)— f (p) 



dp 



duj 



= 0. 



(321) 



We thus see that, in one dimension, the Lenard-Balescu operator vanishes: the diffusion term (first term in the r.h.s.) 
is exactly balanced by the friction term (second term in the r.h.s.). Consequently the collisional evolution is due 
to terms of higher order in 1/N, and the Vlas ov e quation is valid for a longer time than previously expected. This 
remark was raised long ago in plasma physics [l92j | . 



The previous computation can be performed for a gener al (lo ng-range) two-body potential in d dimensions. The 



result is the following general Lenard-Balescu equation [2J 



r al (lo 
, 100] 



df(v,t) 7r(27r) d d 



dt 



dv 



dvidk k— 



u(\k\f 



|D(k,k.vi) 



r *(k-(v-vi)) 



k- ( /(vi,i)— — /(v,t) 



(322) 



where the boldface variables are d-dimensional vectors, and where u(|k|) is the real d-dimensional Fourier transform 
of the two-body potential; m is the mass of the particles and / is normalized to 1. The dielectric function D(uj,k) in 
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this general case is given by 



D(iu,k) = 1 + 



u(\k\) / dv 




(323) 



m 



c<j k ■ v 



The 1/N "smallness" of the right hand side of Eq. ()322|) is incorporated in the two-body potential. From this general 
expression we see that, while in d = 1 the collisional term vanishes, it will instead be present for d > 1. Thus, in 
a two-dimensional Coulombian plasma or for three-dimensional Newtonian interactions and in plasma physics, the 
Lcnard-Balescu collisional term gives a co ntrib ution already at order 1/N. Nicholson has derived its expression for 
general potentials for homogeneous cases |253j ; Chavanis has continued this line of research [98|, [9i| and obtained 
preliminary results for the inhomogeneous case [l02l , Il03l | . Note however that collective effects are not taken into 
account in the latter works. 

The Landau approximation, often used in plasma physics, corresponds to neglect collective effects in Eq. (|322p . 
which amounts to take Z?(w, k) = 1, so that the structure of the Landau equation does not depend on the potential. 

One notes that in a one-dimensional framework, the Lenard-Balescu and the Landau equations coincide: both of 
them reduce to the Vlasov equation. The collisional evolution is due to terms of higher order in 1/iV, implying that 
the collisional relaxation time scales as N s with S > 1. Therefore, the system can remain frozen in a stationary 
solution of the Vlasov equation for a very long time, larger than N. Only non-trivial three-body correlations can 
induce further evolution of the system, as also remarked in Ref. [92| for 2D hydrodynamics. This is the very reason 
of the so long relaxation time emphasized by numerical simulations [3 1 6l | . 



For the HMF model, but this is also true for any one-dimensional long-range system, we have thus proven that 
Vlasov stable homogeneous distribution functions do not evolve on time scales of order smaller or equal to N. 

The ab ove r esult is an illuminating explanation of the numerical strong disagreement which was reported in 
Refs. [ll], l218| | between constant energy molecular dynamics simulations and canonical statistical mechanics cal- 
culations. For energies slightly below the second order phase transition energy (see Fig. I32|) . numerical simulations 
in the microcanonical ensemble show that the system is trapped for a long time, whose duration increases with the 
number N of particles, in a state far from that predicted by equilibrium statistical mechanics. Since the latter were 
initially derived using the canonical ensemble, these results had been believed, at first, to be the fingerprint of in- 
equivalence between microcanonical and canonical ensembles. However, although the interaction is long-range, both 
ensembles lead to the same results for the HMF mod el, where only a second order phase transition occurs as shown in 
Sec. HVDl More careful numerical experiments [316j ] have revealed the tendency of the simulation points, i.e., of the 
out-of-cquilibrium state in which the system is trapped, to lie on the continuation to lower energies of the supercritical 
branch with zero magnetizati on o f the caloric curve. These states have been called quasi-stationary states (QSS). 
More systematic simulations |316| have determined a N 1 - 7 scaling law for the duration of the QSS, at the end of 
which the system eventually evolves towards the Boltzmann-Gibbs equilibrium state. In Fig. 1331 we display the time 
evolution of the magnetization, m(t), with increasing particle number, showing the increase of the duration of the 
QSS: the power-law increase is evidenciated by the choice of the logarithmic scale in the abscissa. Since this scaling 
law has been found when the system is initially prepared in a state that is a stable stationary solution of the Vlasov 
equation (in particular, for initial states homogeneous in 9), this numerical evidence is in agreement with the result 
derived above, that Vlasov stable homogeneous distribution functions do not evolve on time scales of order smaller or 
equal to N. Obviously, even if the initial state is Vlasov stable, finite N effects will eventually drive the system away 
from it and towards the Boltzmann-Gibbs equilibrium state. 

If the initi al co ndition is Vlasov unstable, a rapid evolution will take place. Simulations have been performed also 
in this case |218l |. showing that, after an initial transient, the systems remains trapped in other types of QSS, with 
different scaling laws in N for their duration. The existence of an infinite number of Vlasov stable distributions is 
actually the key point to explain the out-of-equilibrium QSS observed in the HMF dynamics. Let us show that the 
system evolves through other stable stationary states. In order to check the stationarity and the stability of an initial 
distribution fo(0,p), it is possible to study the temporal evolution of the magnetization, which is constant if the 
system is stable and stationary. Other possible macrovariables are the moments of the distribution function. It can 
be easily shown that any distribution of the form f(d,p,0) = fo(6,p) = F{p 2 /2 — m x cos0 — m y smd) = fo(e), with 
generic F, is a stationary solution of the Vlasov equation, provided m x and m y are determined self-consistently from 
Eq. (|273p . Obviously this does not yet mean that this solution is also stable. This latter observation has suggested 



B. Quasi-stationary states, diffusion and entropies 



1. Numerical evidence of quasi-stationary states 
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FIG. 32: Caloric curve of the HMF Hamiltonian (|126|l . The solid line is the equilibrium result in both the canonical and the 
microcanonical ensemble. The second order phase transition is revealed by the kink at e c g = 3/4. The three values of the energy 
indicated by the vertical lines are the stability thresholds for the homogeneous Gaussian (dashed), power-law of Eq. (|295|) 
with v = 8 (dash-dotted) and water-bag (dotted) initial momentum distribution. The Gaussian stability threshold coincides 
with the phase transition energy. The points are the results of constant energy (microcanonical) simulations for the Gaussian 
(losanges), the power-law (squares) and the water-bag (triangles). Simulations were performed with N = 5000. 




FIG. 33: Time evolution of the modulus of the magnetization m(t) for different particle numbers: N = 10 3 , 2.10 3 , 5.10 3 , 
10 4 and 2.10 4 from left to right (e = 0.69). In all cases, an average over several samples has been taken. Two values of the 
magnetization, indicated by horizontal arrows, can be identified in this figure: the upper one (labeled BG) corresponds to 
the expected equilibrium result for the magnetization, while the lower one, labelled QSS, represents the value of M in the 
quasi-stationary state. 



to study numerically the stability of /o by checking the stationarity of the first few moments //„ = (e n ) N . Since the 
stationarity of the moments is a necessary condition for stability, the vanishing, for a long time lapse, of the time 
derivatives fi n = dfi n /dt, for n = 1, 2 and 3, has been used as a numerical suggestion that the system is in a QSS and 
that the distribution fo(0,p) is a stable stationary solution of the Vlasov equation. On the contrary, large derivatives 
clearly indicate a non-stationary state. 

Figure [3~4"l presents the temporal evolution of these quantities, together with the temporal evolution of the modulus m 
of the magnetization, for power-law [y = 8) and gaussian initial distributions in the case of an energy e in the interval 
[£pi,£g]. In the stable case (a), the stationarity holds throughout the computed time since one notices that the three 
quantities fi n have vanishing small fluctuations around zero. On the contrary, in the unstable case (b), the system is 
first in an unstable stationary state (r < 0.0005), before becoming non-stationary (0.0005 < r < 0.003) and finally 
reaches stable stationary states (t > 0.003). Consequently the system evolves among different Vlasov stationary 
states. In the stable case (a), the magnetization m stays around zero before taking off around r = 20 to reach 
the equilibrium value m* . In the unstable case, Fig. I34f b) shows that after experiencing unstable stationary and 
non-stationary states, the system presents a slow quasi-stationary evolution across the infinite number of stationary 
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FIG. 34: Time evolution of the magnetization m. The time is rescaled as r = t/N. The quantities fi n (n = 1,2,3), which 
detect the stationarity, are also plotted (multiplied by a factor 100 for graphical purposes). The equilibrium value of the 
magnetization is m* . Panel (a) corresponds to a stable homogeneous initial distributions with power-law tails, while panel (b) 
shows an unstable initial condition with gaussian tails. In both panels, energy is e = 0.7. All numerical curves are obtained by 
averaging over 20 initial conditions with TV = 10 4 . 



and stable Vlasov states. In Ref. [3 161 ] . a careful numerical study has shown that this slow characteristic timescale 
associated to this final relaxation toward the Boltzmann-Gibbs equilibrium is proportional to TV 1 . However, this 
law might be dependent on the energy e and on the initial distributions. The previous subsection allows however to 
claim that this relaxation timescale is larger than ./V at least. This is thus a very slow process in comparison to the 
relaxation from an initially unstable state. Recently, this slow evolution of the HMF system through different stable 
stationary states of the Vlasov equation has been more systematically studied, with the aim to determine how the 
Vlasov stable solutions characterizing the system during the out-of-equilibrium dynamics can be parametrized [7l|, . 
It has been found that, starting from a homogeneous distribution given by a g-Gaussian with compact support, at 
an energy slightly below the second order phase transition energy, the evolution of the system during the QSS is 
well approximated by distribution functions of the same type, but with varying <j, until the system heads towards 
Boltzmann-Gibbs equilibrium. 

Let us stress tha t the above scenario is consistent with what happens generically for systems with long-range 
interactions (38l. l94l . |234| . In a first stage, called violent relaxation, the system goes from a generic initial condition, 
which is not necessarily Vlasov stable, towards a Vlasov stable state. This is a fast process happening usually on a 
fast timescale, independent of the number of particles. In a second stage, named collisional relaxation, finite TV effects 
come into play and the Vlasov description is no more valid for the discrete systems. The timescale of this second 
process is strongly dependent on TV. One generally considers that it is a power law TV 5 . A typical example is the 
Chandrasekhar relaxation time scale for stellar systems, which is proportional to TV/ In TV. This scenario of the typical 
evolution of long-range systems is summarized in Fig. [35] 

It is important to remark that, recently, Caglioti and Rousset [70j rigorously proved that for a wide class of 
potentials, particles starting close to a Vlasov stable distribution remain close to it for times that scale at least like 
TV 1 / 8 : this result is c onsistent with the power law conjectured for collisional relaxation. Unfortunately, apart from a 
recent progress |l72j . very few rigorous results exist in the case of singular potentials, which would be of paramount 
importance for Coulomb and gravitational interactions. 

Stron ger divergences with system size in long-range systems are observed in connection with metastable states [i~3l . 
HE l248| where the relaxation time increases exponentially with TV. 

In summary, quasi-stationary states observed in the TV-particle dynamics of the HMF model are nothing but Vlasov 
stable stationary states, which evolve because of collisional, finite TV, effects. There is an infinity of Vlasov stable 
homogeneous (zero magnetization) states corresponding to different initial velocity distributions fo(t = 0,p), whose 
stability domain in energy are different. The q-Gaussians in momentum homogeneous distributions are Vlasov stable 
stationary states in a certain energy region where QSS are observed in the HMF model. However, they are not special 
in any respect, among an infinity of others. In the HMF model at finite TV, all of them converge sooner or later to the 
Boltzmann-Gibbs equilibrium. However, the relaxation time is shown numerically to diverge with a power-law TV 5 , 
with 5 ~ 1.7 for the homogeneous water-bag state. 

The time scale r = t/N is thus the appropriate one to study momentum autocorrelation functions and diffusion 
in angle. We will consider such issues in the next subsection, where weak or strong anomalous diffusion for angles is 
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FIG. 35: Schematic description of the typical dynamical evolution of systems with long-range interactions. t v and r c are the 
violent relaxation and the collisional relaxation timescales, respectively. 

predicted, both at equilibrium and for QSS. 

2. Fokker-Planck equation for the stochastic process of a single particle 

Let us now consider the relaxation properties of a test particle, initially with a momentum p\ and an angle 8\, 
immersed in a homogeneous background of N particles; the latter is consequently a thermal bath, or a reservoir, for 
the test particle. The description of the motion for a so-called test particle in a system with identical particles is a 
classical problem in kinetic theory. Initially the test particle is in a given microscopic state, while the other particles 
are distributed according to the distribution fo and the test particle is assumed not to affect the reservoir. The 
interaction with the fluid induces a complicated stochastic process. 

The test particle distribution is initially not in equilibrium, and not necessarily close to it. However, it is natural 
to expect that the distribution of the test particle will eventually correspond to the distribution of the bath generated 
by all the other particles. How it evolves from the initial Dirac distribution fi(0,p, 0) = 5(9 — 9i)6(p — p{) toward the 
equilibrium distribution is thus of high interest. We will show that the distribution is a solution of a Fokker-Planck 
equation that can be derived analytically. This equation describes the dynamical coupling with the fluctuations of the 
density of particles, which induce fluctuations in the potential: this is the origin of the underlying stochastic process. 

We analyze therefore the relaxation properties of a test particle, indexed by 1, surrounded by a background system 
of (N — 1) particles with a homogeneous in angle distribution fo(p). The averaged potential (v) still vanishes for a 
homogeneous distribution, so that the particle only feels the fluctuations of the potential, that, according to Eq. (|272p . 
is given by 5v(9)/y/N. The potential felt by the test particle at the position 9\(t) is therefore 6v(0)/y/~N computed 
at 9 = 9\(t). We thus expect that the instantaneous force on the test particle will be of order 1/yN. The equations 
of motion of the test particle are therefore 

d9i , dpi 1 d6v(9,t) 

—— = pi and — — = = — 

dt dt ^/n 09 

the integration of which leads to (omitting the index 1 for the sake of simplicity) 

1 f Ul d6v 

9(t) = 6(0)+p(0)t--= / dm / du 2 — (9(u 2 ),u 2 ) (325) 
VN Jo Jo 

p(t) = p(0) - J= J d«^((9(«), u) , (326) 

where, again for simplicity, we have indicated directly inside the dependence of 5v the test particle variable. The key 
point of this approach is that we do not limit the study to the usual ballistic approximation, in order to have an 
expansion exact at order 1/N. Therefore it is of paramount importance here to treat accurately the essential collective 
effects. 



(324) 

e=fli (t) 



77 



By introducing iteratively the expression for the variable 6 in the right-hand-side of Eq. (|326[) and by expanding 
the derivatives of the potential, one gets the result at order 1 /N of the momentum dynamics 



P(t) 



1 /"* dSv 
K0)-^= J du—(8(0)+p(0)u,u) 

r r Ul dSv 

J Q du iJ Q du 2 — (6(0)+p(0)u2,u 2 ). 



d 2 5i 



~ f du^(0(O)+p(O)u,u) 



(327) 



As the changes in the momentum are small (of order l/VN), the des cript ion of the momentum dynamics is well 
represented by a stochastic process governed by a Fokker-Planck equation [305| . If we denote by f\ (p, t) the distribution 
function at time t of the test particle momentum, then the general form of this equation is 



dfiM 

at 



d_ 

dp 



with 



A(p,t) = lim-((p(t + T)-j>(*)))p(t)=p 

r— n) T 

B(p,t) = Yim^((p(t + r)-p(t)) 2 ) p{t)=p , 



(328) 

(329) 
(330) 



where the expectation values are conditioned by p(t) — p. This equation is therefore characterized by the time 
behavior of the first two moments, called Fokker-Planck coefficients. We approximate these two coefficients by an 
expression which is valid in the range of time t defined by 1 -C t <C N. In this time range, using a generalization 
of formula (|3 13[) that takes into account that the initial coordinates of the test particles are given, it is possible to 
obtain (see Appendix E) 



A(p,t) 
B(p,t) 

where the diffusion coefficient is given by 



1 

N 



dD 
dp 



(P) 



±dfo 
fo dp 



Dip) 



D(p) = 2Re / dte lpt (6v{l,t)6v(-l,0)) = n 



fo(p) 



D(p, 1) 

Substituting (|33 1[) and (|332[) in the general form of the Fokker-Planck equation (|328p . one ends up with 



(331) 
(332) 

(333) 



at 



1__9 

N dp 



Dip) 



a/i(p,t) i df 



dp 



fo dp 



(334) 



We thus recover what has been established in plasma physics (see e.g. Ref. [185]): the evolution of the velocity 
distribution /i (p, t) of the test particle is governed by a Fokker-Planck equation that takes a form similar to the 
Lcnard-Balescu equation (|32 1[) provided that we replace the distribution /o (p, t) of the bath by f% (p, t) . The integro- 
diffcrcntial equation is thus transform ed in the Fokker-Planck differential equation. Similar results in higher dimension 
have been obtained later in Refs. [9^, llOOl ] . 

For one-dimensional systems, it has been shown in Sec. IV A"5l that the Lenard-Balescu collision term cancels out so 
that the distribution function does not evolve on a time scale of order N. Since, on the other hand, the Fokker-Planck 
equation (|334j) shows that the relaxation time of a test particle towards the distribution of the bath is of order N, 
this implies that we can assume that the distribution of the particles f is stationary when one studies the relaxation 
of a test particle. This is true for any distribution function /o(p) that is a stable stationary solution of the Vlasov 
equation. This is not true in higher dimensions, except for the Maxwellian distribution. 

Equation (|334p emphasizes that the momentum distribution of particle 1 evolve on timcscales of order N. We will 
thus introduce the timescalc r = t/N , so that the Fokker-Planck equation can be rewritten as 



d_h 

dr 



a_ 

dp 



I dfx 1 df 
^ ' ~dp ~ h~dp 



(335) 
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valid for times r at least of order 1. We see that the time derivative of fi vanishes if the distribution function 
fi of the test particle is equal to the quasi-stationary distribution / of the surrounding bath. We expect that /i, 
governed by this Fokkcr-Planck equation, will converge to fo in a time r of order 1: this means that the distribution 
function f\ of the test particle converges towards the quasi-stationary distribution fo of the surrounding bath. Thus, 
it does not converge towards the equilibrium Gaussian distribution, in complete agree men t with the result that fo is 
stationary for times scales of order N. This result is not valid in higher dimensions [lOCj . It is important to stress 
here that collective effects are taken into account through the presence of the dielectric response function D(p, 1) in 
the denominator of the diffusion coefficient (see Eq. (|333p ). 

Analyzing the stochastic process of equilibrium fluctuations in the particular case of homogeneous Gaussian distri- 
bution, Bouchet [5^ | derived the diffusion coefficient of a test particle in a equilibrium bath. His result is recovered 
when considering a homogeneous Gaussian distribution (|292[) in expression (|333|) since one gets 



D(p) 



2 



1 

2tv 



§_ e -/?P 2 /2 
2tt 



0S/2 pe -frf/i 



~/2dt 



(336) 



-/V 



plotted in Fig. | 
tails D{p) ~ 0r/?/8 



It is important to stress that such an expression leads to a diffusion coefficient with Gaussian-like 

e -/3p 2 /2. 




FIG. 36: Diffusion coefficient -D(p) in the case e 
(dashed line). 



2 for a Boltzmann thermal bath (solid line) and a waterbag distribution 



The above general derivation [6l| allows however to study any arbitrary distribution. Let us carry on here the 
calculation of the diffusion coefficient for the Vlasov-stable water bag distribution (|293[) . It has an interesting behavior, 
since the dielectric response D{lo, 1) has zeroes on the real axis, contrary to any even distributions strictly decreasing 
for positive values of the frequency u>. One gets 



Dip) 



^r [Qip + po)-eip- P o)} 



1 

1 — 



2P 2 -P 2 



^- o (S(p + Po) 



Sip-po)) 



(337) 



which is also plotted in Fig. [36j One can obtain similar results for g-exponential distributions (|294[) (see Ref. |lld |) 
or power-law tails distributions (|295|) . 



3. Long-range temporal correlations and diffusion 



Since the Fokker-Planck equation for the single particle distribution function flip) has a variable diffusion coefficient, 
the relaxation towards the Boltzmann distribution can be slowed down, especially if the diffusion coefficient decreases 
rapidly with momentum. It is thus important to study this Fokkcr-Planck equation for different distribution functions 
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of the bath. A similar study was earlier performed for 2D vortices in Ref. [92J. One consequence is that velocity 
correlation functions can decrease algebraically with time instead of exponentially, a behavior which might lead to 
anomalous diffusion as we will show below. 



By introducing the appropriate change of variable x = x(jp), defined by dx/dp = 1/y D(p), and the associated 

distribution function /i, defined by /i(r, x)dx = fi(r,p)dp, one can map [HI, [M EH [M EH the Fokker-Planck 
equation (|335|) to the constant diffusion coefficient Fokker-Planck equation 



dfi = d_(dfi 
dr dx \ dx 




(338) 



where the potential ip(x) is given by 



i>{x) = -\n(yD(p(x))f (p(x))). (339) 



Using the property D(p, 1) P ^°° 1, that implies, by Eq. ()333|) . that D(p) p ^°° 7r 2 / (p), one gets 4>( x ) x ^°° 
— | \nfo(p(x)). From this, one derives that for many classes of distribution functions /o, the potential ip{x) is 
asymptotically equivalent to a logarithm. In fact, we have 

^{ x ) x ^°° a \xy\x\, (340) 

with a = 3 if fo(p) decreases to zero more rapidly than algebraically for large p, and a < 3 if fo(j>) decreases to zero 
algebraically; more precisely, a = 3v/(2 + v) if fo(p) decays at large p as p~ u ■ For weakly confining potentials ip( x ), 
i.e., when v < 1 and thus a < 1, Eq. (|338p has a non-normalizablc ground state. The example of the heat equation, 
which corresponds to ip(x) = 0, describes a diffusive process leading to an asymptotic self-similar evolution. In such 
a case, the spectrum of the Fokker-Planck equation is purely continuous. By contrast, a strongly confining potential 
ip(x) (for instance, the Ornstein-Ulhenbeck process with a quadratic potential) would lead to exponentially decreasing 
distributions and autocorrelation functions, linked to the existence in the spectrum of a gap above the ground state. 
The logarithmic potential (|340|) is a limiting case between the two behaviors. The normalizable ground state is unique 
and coincides with the bottom of the continuous spectrum. The absence of a gap forbids a priori any exponential 
relaxation. To illustrate this result, we evaluate the asymptotic behavior explicitly in two cases. 

• Let us first consider distribution functions fo(j>) with fast (more than algebraically) decreasing tails so that 

/o(p) IPl ~ ° Ccxp(-/?p 4 ), (341) 
which includes not only the Gaussian (5 = 2) and exponential tails (5 = 1), but also stretched-exponcntial ones 



with any arbitrary positive exponent 5. From the change of variable dx/dp = l/y\D(p), asymptotic analysis 

leads to p(x) ' ~ (2 In |.t|//3) 1 ^ 5 and to ip(x) x ^7°° 31n|x|. These estimates are sufficient (see Ref. [6l| for 
details) to evaluate the long time behavior of the momentum autocorrelation function 

<p(r)„(0)> T ^ ^!L, (342) 

T 

which proves the existence of long-range temporal momentum autocorrelation for all values of 5, and therefore 
also in the case of Boltzmann equilibrium, 6 = 2. 

Let us now consider a distribution function fo(p) with algebraic tails 



fo(p) lP ~°°C\pr. (343) 



In this case, one has p(x) X C'x 2 ^' 2+l '^ and the asymptotic behavior (|340[) with, as we said, a = 3u/(2 + v). 
We consider only cases where v > 3, to ensure that the second moment of the distribution f (i.e., the average 
kinetic energy) does exist. For v > 3 one has that 9/5 < a < 3. The result for the momentum autocorrelation 
function is 



(p(r)p(0)) ' 7t° r^ 3 -")/^) , (344) 



which characterizes an algebraic asymptotic behavior. 
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From the momenta autocorrelation, one usually derives the angle diffusion ((6(t) — 0(O))2) = 2Dqt where Dg is 
defined via the Kubo formula 

r+ao 

D e = dr (p(t)p(O)) . (345) 

Jo 

However, since the exponent (3 — v)/(2 + v) = — 1 + 5/(2 + v) is larger than —1, the asymptotic result (|344j) shows 
that the integral (|345|) diverges. The asymptotic result (|342[) leads also to a divergent integral, although less singular. 
It is thus natural to expect anomalous diffusion for the angles. This extremely small anomaly (logarithmic) for 
distributi on functio ns with Gaussian or stretched exponential tails induced difficulties to detect numerically anomalous 
diffusion (217ll3li |. 

Note that, generalizing the theory of Potapenko et al. [268| . Chavanis and Lemou jl05| studied how the structure 
and the progression of the distribution function tails, also called fronts, depends on the behavior of the diffusion 
coefficient for large velocities. They showed that the progression of the front is extremely slow (logarithmic) in that 
case so that the convergence towards the equilibrium state is peculiar. 

Using the time rescaling r = t/N, which introduces a factor 1/iV 2 , the angles diffusion ct 2 (t) = ([9(t) — 6>(0)] 2 ) can 
be rewritten as 

^ = J\rxJ\r 2 {p{^)p{r 2 )) (346) 

pT fT—S 

= 2 ds dr 2 (p(s + t 2 ) P (t 2 )) , (347) 

Jo Jo 

in which the new variable s — t\ — t 2 has been introduced to take advantage of the division of the square domain into 
two isoscale triangles corresponding to s > and s < 0. In the quasi-stationary states, the integrand (p(s + t 2 )p (t 2 )) 
does not depend on t 2 (the QSS evolves on a time scale much larger than N) and hence diffusion can be simplified |314| 

as 

^ = 2 £ d S (r- S )(p(s)p(0)) . (348) 

A distribution with power law tails (|343[) will therefore correspond to 

((0(t) - 6(0)) 2 ) T H°° t 1+ ^. (349) 



A comparison of this predicted 61| anomalous diffusion for angles with direct numerical computation of the HMF 
dynamics is a tough task because of the scaling with N of the time dep ende nce of the autocorrelation function. 
However, it has been recently confirmed by numerical simulations in Ref. [317| . For the stable case (e = 0.7) with 
power-law tails (y = 8), the theory predicts that the correlation function decays algebraically with the exponent —1/2 
(see Eq. (|344p ). Figure IBTT a) shows that the theoretical prediction agrees well with numerical computations. The 
expression Q349P can be rewritten in that case as <r 2 (r) ~ r 3 / 2 : Figure l3"?Tb) . in which the four curves for the four 
different values of N almost collapse, attests also the validity of this prediction. This is a clear example of anomalous 
diffusion. 

For the stretched exponential distribution function (|341[) . one ends up with 

((0(t) - 9(0)) 2 ) T lx°° r(lnr) 2 / m . (350) 

As for power law tails, the diffusion is again anomalous, although with a logarithmically small anomaly. Consequently, 
the anomalous diffusion for angles also occurs for the Gaussian distribution which corresponds to the specia l case 8 = 2. 
This weak anomalous diffusion, i.e. normal diffusion with logarithmic corrections, has also been confirmed [317j |. Since 
Gaussian distributions correspond to equilibrium distributions in the microcanonical and the canonical ensembles, it is 
important to realize that anomalous diffusion might thus be encountered for both equil ibrium and out-of-equilibrium 
initial conditions. An analogous behavior has been observed for point vortices (941 . Il06l |. 



4- Lynden- Bell's entropy 



In subsection I V B 1 1 we have discussed in detail, focusing on the HMF model, the two-stage relaxation process that 
is often observed in systems with long-range interactions. It has been indeed realized, beginning with a seminal paper 
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FIG. 37: Check of the theoretical prediction for stable initial distributions with power-law tails, in the case s = 0.7. Points are 
numerically obtained by averaging 20, 20, 10 and 5 realizations for N = 10 3 , 10 4 , 2.10 4 and 5.10 4 respectively. In panel (a), four 
curves represent the correlation functions of momenta, while the straight lines with the slope —1/2 represent the theoretical 
prediction. The curves and the lines are multiplied from the original vertical values by 2,4 and 8 for N = 10 4 ,2.10 4 and 
5.10 4 for graphical purposes. Similarly, panel (b) presents the diffusion of angles, while the straight line with the slope 3/2 is 
theoretically predicted. The four curves for the four different values of N are reported and almost collapse. 



by Henon [174l | on globular clusters, that the dynamical evolution is divided in two well separated phases. A first 
phase, called dynamical mixing, where an initial fast (violent) evolution leads to a quasi- stationary state, and a second 
phase, called relaxation phase, where "collisions" have the cumulative effect to drive the systems towards statistical 
equilibrium. If the number of particles is large, the two phases are well sepa rated in time. It was also observed that the 
quasi-stationary state was strongly dependent on the initial condition 174j . In subsection lV A 41 we have discussed the 
existence of stationary stable and unstable one-particle distributions for the Vlasov equation and we have presented 
(subsection IV B l| ) the interpretation of quasi-stationary states in terms of "attractive" Vlasov equilib ria. A statistical 
approach that explains the existence of Vlasov equilibria has been proposed long ago by Lynden-Bell [23dj | . He begins 
by remarking that Vlasov equation, which represents the evolution of an incompressible fluid, obeys Liouvillc theorem 
in six dimensions. We will here restrict to a two-dimensional phase space for simplicity. This implies that the "mass" 
of phase elements between / and / + df is conserved (remind that / is the one-particle distribution function). If 
we discretize the one-particle distribution function into a set of k quantized levels rji, i = 1, . . . ,k, this means that 
the area corresponding to the i-th level m(r]i) = J dddpS(f(9,p,t) — rji) is a constant of the motion. In the limit of 
a continuous distribution of levels, one obtains an infinity of conservation laws. It is easy to prove that this implies 
that any functional of the form J dddpC(f) is conserved: these functionals are called Casimirs. For instance, Gibbs 
entropy is a particular Casimir: Sabbs = ~ J dOdpflnf. This specifically implies that, in terms of the fine grained 
one-particle distribution, Gibbs entropy cannot increase. Let us then define a coarse grained distribution 



(351) 



where p{9,p, rji) dOdp is the probability of finding level rji in the phase-space macroccll D macro = [6, 6 + d9] x [p,p + dp\. 
Obtaining this probability directly from the microscopic dynamics is an extremely difficult problem, and its solution 
would constitute a significant step forward in the understanding of the natur e of quasi-stationary stat es. Lynden-Bcll's 
proposal is to evaluate p{6,p,rji) by using a sort of "Boltzmann principle" [l4l[ 12431 1274 I275I . |276| . The evaluation 
of p{9,p,r]i) can be done for any number of levels k, but to make the calculation simpler, let us approximate the 
one-partic le d istribution with only two levels rji — and 772 = fo (the case with many levels has been treated in 
Refs. [97l. |230( ). Alternatively, one can consider a fine-grained distribution function which, at t = 0, has only two 
levels: these are called water-bags in astrophysics and plasma physics. The exact time evolution of the water-bag is 
such that the shape of the area occupied by the Vlasov "fluid" with level fo is deformed, stretched a nd fo lded due to 
the hyperbolicity originated by the nonlinear dynamics of the Vlasov equation (see e.g. Fig. 2 in Ref. |230l |). However, 
due to Liouville's theorem, this area is conserved. Let us divide each macrocell D macro into v microcells of volume 
ui and consider a microscopic configuration in which the i-th. macrocell is occupied by rii microcells with level fo and 
v — ni with level zero. The total number of occupied microcells is Af, such that the total mass is m = Afuifo- This 
latter is also equal to the normalization of the fine grained distribution m — J d9dpf(9,p). The Af occupied microcells 
are first placed into macrocells. There are AO/IXjTij! wavs to do this. Within the i-th cell, one can distribute the 
first of the rii occupied microcells in v ways, the second in v — 1 and so on. The number of ways of assigning the n. 
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occupied microcclls is thus v\/{v — rii)\. Then, the total number of microstates compatible with the macrostate where 
n,j microcells are occupied in macrocell i is given by the product of these two factors 

TV! „ v\ 



W({ ni }) 



n, 



n 



(v-th)\ 



(352) 



The first factor is calculated exactly as for Boltzmann gas [183l | . because the occupied microcells are distinguishable, 
while the second factor reminds Fcrmi-Dirac statistics and derives from an exclusion principle, which is a consequence 
of fluid incompressibility: one microcell cannot be occupied more than once by a fluid element of level fo. Apart from 
this latter constraint, fluid microcclls arc let to distribute freely among the different macrocells: this corresponds to 
making an assumption of ergodicity. This doesn't happen for the true Vlasov dynamics and is sometimes referred to 
as the hypothesi s of efficient mixing. Indeed, it has been found that dynamical effects can hinder mixing 
HE) HH 12231 |224| . Using Stirling's approximation and expressing m in terms of the average probability to find level 
fo in cell i, Pi(fo) = ni/v, one obtains, neglecting an additive constant, 



In W = v ^2 Pi ln /°i + 0- - Pi) M 1 - Pi), 



(353) 



which can also be rewritten in terms of the coarse grained distribution function, pi 
limit J2i ~ * / d6dp/(uu), one finally gets 



s LB [f\ 
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Following the standard procedure, inspired by large deviation theory [3001 ] . one then maximizes slb subject to the 
constraint of conserving energy E, mass m and other global invariants like momentum (or angular momentum for 
higher dimensions). 

Historically, the first applications of Lynden-BclPs ideas encountered both confirmations and failures [119L Il76l 1 180L 

|318| , although the crucial point of necessarily performing simulations with a large number of particles was never clearly 
addressed, due to computer time limitations. Very recently, a careful analysis of the one-dimensional self-gr avita ting 
sheet model has shown that Lynden-Bell statistics applies for initial data with virial ratio close to unity |315l ] . A 
statistical theory similar to Lynden-Bell's was independently developed for the Euler equation [245ll275ll277l | and, later 
on, the deep analogy between the Vlasov-Poisson system and the Euler equation was for the first time clearly stressed 
in Ref. [l09j ] . This approach was then used in the context of mean- field models in Refs. [l|| [Hj], with an application 
to the Colson-Bonifacio model of the Free Electron Laser (see subsection IIV E 6|) . As already discussed in section 
IV B 11 th e presence of quasi-stationary states for the HMF model was recognized and characterized numerically in 
Ref. |316(. Lynden-Bell's theory was then shown to predict the main features of the one-particle distribution function 
of quasi-stationary states of the HMF model in Ref. [l7|. Let's finish this subsection by discussing in some detail this 
latter result, which clearly shows the power of Lynden-Bell's approach. 
Let us first recall the Vlasov equation for the HMF model 



dj_ dl_dVdl = 
dt P d9 d9 dp 

where f(9,p,t) is the fine grained one-particle distribution function and 

V(0)[f] = l-M x [/]cos(0)-M tf [/]sin(0) 

-7T Z'+OO 



/ + r-\-OC 
/ f (9, p,t) cos 9d9dp 
-7T J —OO 

/ + 7T /'+OC 
/ f (9, p,t) sin 9d9dp 
-7T J —OO 



The globally conserved quantities are energy 



Kf] 



f(9,p,t)d9dp 



Ml 



Ml 



and momentum 



P[f]= Pf(9,p,t)d9dp 



(355) 

(356) 
(357) 

(358) 

(359) 
(360) 
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As for the initial distribution, we consider a water bag with rectangular shape in the (9,p) plane. The distribution 
/ takes only two distinct values, namely /q = l/(4A0Ap), if the angles (velocities) lie within an interval centered 
around zero and of half- width A9 (Ap), and zero otherwise ("mass" m is normalized to one and momentum P[f] is 
zero) . There is a one to one relation between the parameters A9 and Ap and the initial values of magnetization and 
energy 

sin(Afl) (Apf l-(Mp) 2 
Mo = ^9- ' £ = — + 2 ' (361) 



While h[f] — e and P[f] = are constants of the motion, magnetization M = y M 2 + M 2 evolves with time. 
Lynden-Bell's maximum entropy principle is then defined by the following constrained variational problem 

s(e) = max (s(f) h(J) = e; P(J) = Q:J d6dp / = 1 V (362) 

The problem is solved by introducing three Lagrange multipliers /3//o, A//o and fJ,/fo for energy, momentum and 
mass normalization. This leads to the following analytical form of the distribution 

■^ 9,P ^ ~ 1 + cxp [(3(p 2 /2 - M y [/] sin 6 - M x [/] cos 9) + Xp + p] ' 

This distribution differs from the Boltzmann-Gibbs one because of the "fermionic" denominator. Inserting expression 
(|363[) into the energy, momentum and normalization constraints and using the definition of the magnetization, it 
can be straightforwardly shown that the momentum multiplier vanishes, A = 0. Moreover, defining x = and 
m = (cos 9, sin 9), yields the following system of implicit equations in the unknowns /3, x, M x and M y 



xe 0M - m ) = 1 (364) 



_£_ f d9 e pm. m p 2 (a;e /3M. m) = e + M^_J_ (3g5) 

(366) 
(367) 



'2/3 3 / 2 

fo4pjd6 cos 9 m F (xe^) = M x 
fo^= J d8 sm9e? M m F (xe^ M m ) = M y 



withFo(y) = / cxp(-i> 2 /2)/(l + yexp(-v 2 /2))dv and F 2 (y) = J v 2 exp(-w 2 /2)/(l + yexp(-v 2 /2))dv. This system of 
equations can be solved numerically and, given the parameters that fix the initial conditions, i.e. energy, momentum 
and Mq, univocally determines the values of the Lagrange multipliers (3 and p, the values of M x and M y where Lynden- 
Bell's entropy is extremal and, finally, the distribution / in formula (|363|) itself, with no adjustable parameter. For 
e = 0.69, the maximum entropy state has zero ma g neti zatio n for Mq < M cr n = 0.897, a value at which Lynden-Bell's 
theory predicts a second order phase transitions [l7], EH, 1 1 OOj ] . Interpreting / as the distribution in the quasi-stationary 
state (QSS), we obtain in this case 

/ = Iqss (p) = — r {° 2/r) , — i , (368) 

1 + exp [pp 2 /2 + fi\ 

with (3 and (i to be determined from the knowledge of Mq. Velocity profiles predicted by (|368[) are displayed in Fig. |3"81 
for different values of the initial magnetization. 

Although not a single free parameter is used, one finds an excellent qualitative agreement. The presence of two 
symmetric bumps in the velocity distributions is not predicted by Lynden-Bell's theory and is a consequence of a 
collective phenomenon which leads to the formation of two clusters in the (6,p) plane. This is shown by the direct 
simulation of the Vlasov equation (|355[) presented in Fig. [39j The bumps represent an intrinsic peculiarity of QSS 
and have been characterized dynamically in Ref . [22| . 

More recentl y, oth er authors have obtained similar encouraging results for Lynden- Bell 's theory in a model of non- 
neutral plasma |223| and for radially symmetric solutions of a self-gravitating system 224]. They also find that, when 
dynamical effects lead to collective oscillations of the mean-field, Lynden-Bell's theory shows a disagreement with 
numerical data and propose a modification of the theo ry. W ork along this line is in progress. 

The theoretical approach proposed by Lyndcn-Bcll [230] allo ws to predict the presence of a phase transition line 
in the (Mo,e) and in the (/o,e) control parameter planes [3, Il0l| . At equilibrium the At equilibrium the phase 
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FIG. 38: Velocity distribution functions in the quasi-stationary state for the HMF model with e = 0.69 and different values of 
Mo. Symbols refer to numerical simulations, while dashed solid lines stand for the theoretical profile (|368[) . Panels a), b) and 
c) present the three cases Mo = 0.3, Mo = 0.5 and Mo = 0.7 in lin-log scale, while panel d) shows the case Mo = 0.3 in lin-lin 
scale. The numerical curves are computed from one single realization with N = 10 7 at time t = 100. 



transition does not depend on the choice of Mq and is located at e = 3/4. On the contrary, in th e out -of-equilibrium 
QSS the phase transition line is located at smaller energies joining the transition points (0, 7/12) |316l | and (1, 3/4) in 
the (M q, e) plane. Moreover the phase transition changes from first to second order. These latter results are resumed 
in Ref. flQ4j . 

The Lynden-Bell approach turns out to be a good way to attack the problem of quasi-stationary states. It gives 
predictions for both averages and distributions functions which compare quite well with numerical simulations. When 
the approach fails to describe detailed features, they are viable ways of modifying it taking into account dynamical 
properties. 



VI. GENERALIZATION TO NON MEAN-FIELD MODELS 



Although substantial progress has been done recently in the understanding of long-range interactions, the hardest 
questions about physical systems where the interactions weakly decay with the distance, and also shows singularities 
at short distances, remain open (see the introductory section HTT|) . In section HVl we have shown how simple mean- 
field models, that arc explicitly solvable in both the canonical and the microcanonical ensemble, already show several 
features previously encountered in gravitational systems, e.g. negative specific heat. These simple models also display 
peculiar out-of-equilibrium dynamical effects, like the presence of quasi-stationary states, which resemble those found 
for realistic systems, see section IVl 

In this section, we make a little step in the direction of the study of Hamiltonians that are not fully mean-field. In 
subsection IVI Al we introduce, for both a mean-field Ising and an XY model, a nearest-neighbour interaction term. 
We show that the phase diagram in canonical and microcanonical ensembles preserves the features of non equivalence 
found for pure mean-field Hamiltonians. In particular, we mention the non coincidence of the microcanonical and 
canonical tricritical points and the presence of negative specific heat. In subsection IVI B[ we consider an Ising model 
in one dimension with weakly decaying coupling and a modification of the HMF model which includes a coupling 
with the same properties. For both models, we show that the mean-field properties extend to the weakly decaying 
case and that one can obtain analytically the free energy and the entropy. Since the corresponding mean-field limit 
of both models shows ensemble equivalence, this property persists also for such models. In subsection IVI B 31 we 
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FIG. 39: Simulation of the Vlasov equation (|355[) that starts from a water-bag initial condition with e = 0.69 and Mo = 0.5. 
The final snapshot (lower right panel) is the quasi-stationary state 



show the analytical solution in both the canonical and microcanical ensembles of a self-gravitating system of particles 
moving in one dimension. The phase diagram is different in the two ensembles and show all features of ensemble 
inequivalence. In an appropriate limit, the model reduces to the HMF model and therefore ensemble inequivalence 
can be turned off by varying a parameter. Finally, in subsection IVI B 41 we conclude with a summary of a very 
recent result concerning a spin system with dipolar interactions whose Hamiltonian can be reduced in appropriate 
limits to that of the XY model with nearest neighbour and mean- field interactions presented in subsection IVI Al This 
result opens the possibility to verify experimentally some of the striking and counterintuitive features of long-range 
interactions. 



A. Systems with short- and long-range interactions: the transfer integral method 

In this subsection, we will present both the canonical and the microcanonical solutions of an Ising model and 
of an XY model with nearest neighbour and mean-field interactions. We will obtain the phase diagram in both 
ensembles. All the features of mean-field models discussed in Sec. IIVI will appear again in this context: ensemble 
inequivalence, negative specific heat, temperature jumps, ergodicity breaking. This shows that the addition of a short- 
range term to a long-range Hamiltonian does not remove the interesting behaviors described before. For what concerns 
methodological aspects, the use of transfer integral combined with the Hubbard-Stratonovich transformations and the 
min-max method allows us to treat both models with discrete and continuous variable. The models we will consider 
are defined on a one-dimensional lattice (indeed a ring because of the periodic boundary conditions), but there is no 
reason of principle that forbid to extend the calculation to strip and bar geometries. 



1. Ising model 

a. Introduction Nagle introduced an interesting Ising mo del combi ning long- with short-range interactions [25 
Further elaborations of the model were proposed by Khardar 194, 195]. The Hamiltonian is 



\i=l J i=l 



(369) 
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where Si = ±1. In this onc-dimcnsional spin chains, the first term has an infinite range and is the typical one of the 
Curie- Weiss Hamiltonian (UJ). This term is responsible for the non additive properties of the model, see Sec. Ill Al 
Note that the prefactor J present in Hamiltonian (fT]) has been set to one by an appropriate renormalization of the 
energy. There is no loss of generality, since only the ferromagnetic case J > will be considered. On the contrary, 
the second term of (|369[) corresponds to an interaction between nearest neighbors along a one-dimensional lattice 
(periodic boundary conditions are chosen). The coupling constant K might be either positive or negative. 

The ferromagnetic state with all spins up (Si = 1, Vi) or down (Si = — l,Vi) has a negative energy Ep — —N/2. For 
the antifcrromagnetic state with alternate signs of nearest neighbour spins, the first term of (|369[) gives a vanishing 
contribution to the energy and the energy is Ea = KN. At T = 0, one can determine if the model has a phase 
transition at some value of K by comparing the energy of the ferromagnetic state with that of the antiferromagnetic 
one, since only the energy term of the free energy matters. Hence, by imposing Ea = Ep one gets the phase transition 
value K t = —1/2 at which a discontinuity of the order parameter is found, from m = to m = 1. Therefore the 
transition is first order. 
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FIG. 40: Elementary features of the phase diagram of the short plus long-range Ising model showing the phase transitions on 
the temperature T and local coupling K axis, respectively. 



For non zero temperature, one has to take into account the entropic term of the free energy, which measures disorder. 
When the coupling constant K vanishes, one fully recovers the Curie- Weiss Hamiltonian (fT]), which exhibits a second 
order phase transition at T = 1. One therefore expects that the (T, K) phase diagram displays a transition line which 
is first order at low T and second order at high T (see Fig. HO)) . The transition line separates a ferromagnetic from a 
paramagnetic state, although exactly at T = and K < —1/2 the state is antiferromagnetic. Let us now determine 
analytically this transition line in both the canonical and the microcanonical ensemble. 

b. The solution in th e canonic al ensemble The phase diagram has been studied in the canonical ensemble by 
Nagle (250} and Khardar [Hi, [Hi]. The partition function is 



Z(J3,N) = 



E 



exp 



{Si,...,SW} 



{Si,...,S N } 



i+i 



i) 



(370) 



To get rid of the quadratic term, we use the Hubbard-Stratonovich transformation 

dec e 1=1 



(3N 
~2T 



(371) 
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so that the partition function (|3T0|) can be rewritten as 



N 



Z((3,N) 



(3N 



+ 00 

-OO 
+ OO 



dx 



PN 2 
e~— X 



N 



E 

{Si,...,S N } 



dx e 



-Nf3f (0,x) 



The free energy can be written as 



f(P,x) = -x 2 + f (p,x), 



(372) 
(373) 

(374) 



where fo(P, x) is the free energy of the n earest-neighbo r Ising model with an external field x. Such an expression can be 
easily derived using the transfer matrix an easy calculation, we find /o(/3, x) = — ln(A+ +A^)/(/3iV) 

where the two eigenvalues of the transfer matrix are 



A± = e pK/2 cosh{(3x) ± \[eP* sinh 2 (/3x) + e~' 3K . (375) 
As A+ > A_ for all values of x, only the larger eigenvalue A+ is relevant in the limit N — ► 00. One thus finally gets 



4>((3,x)=0f(P,x) = fx 2 -In 



^ K/ ' 2 cosh(/3x) + ^eP K smh 2 (/3x) +e-P K 



(376) 



which is shown in Fig. [¥T] for different values of the inverse temperature (3 and for two values of the nearest-neighbor 
coupling K. In the large iV-limit, the application of the saddle point method to Eq. (|373l) finally leads to the free 





x 



X 



FIG. 41: f{P,x) for different values of the inverse temperature. Left panel presents /3 = 1.1 (dashed line), (3 C — 1.4 (dotted), 
j3 — 2.5 (solid) when K — —0.25: a second order phase transition. Right panel shows the case K = —0.4 when /3 = 10 (dotted), 
fit — 2.4 (solid), fi = 2.35 (dash-triple dot), fi = 2 (dashed): a first order phase transition. Note that the different curves have 
been vertically shifted for readability purposes. 

energy, which is obtained by taking the value of x which minimizes cj>(P,x) in formula (|376[) . 

From the knowledge of the free energy, as anticipated, one gets either a second or a first order phase transition 
depending on the value of the coupling constant K. As usual, the expansion of /(/3, x) in power of x is the appropriate 
procedure to define the critical lines and points. One gets here 

~f{f3,x) = - In 2 cosh ^ + (1 - Pe? K ) + ^ K (3e 2 ^ - l) x 4 + 0(x 6 ). (377) 
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The critical point of the second order transition is obtained for each K by computing the value (3 C at which the 
quadratic term of the expansion (|377|) vanishes provided the coefficient of the fourth order term is positive, obtaining 
0c = exp (—(3 C K). When also the fourth order coefficient vanishes, 3exp(2/37V) = 1 one gets the canonical tricritical 
point (CTP) Kqtp = — In 3/ (2\/3) — —0.317. The first order line is obtained numerically by requiring that f((3, 0) = 
x*), where x* is the further local minimum of /. Figure [4"2l represents the phase diagram in both the canonical 
and the microcanonical ensembles. The features of this phase diagram are very close to those of the BEG model (see 
Fig. [M]) and of the generalized HMF (see Fig |23)) . We will comment them below, after having obtained the solution 
in the microcanonical ensemble |248|1 




-0.5 -0.4 -0.3 K -0.2 



FIG. 42: The canonical and microcanonical (K, T) phase diagram. In the canonical ensemble, the large K transition is 
continuous (bold solid line) down to the tricritical point CTP where it becomes first order (dashed line) . In the microcanonical 
ensemble the continuous transition coincides with the canonical one at large K (bold line). It persists at lower K (dotted line) 
down to the tricritical point MTP where it turns first order, with a branching of the transition line (solid lines). The shaded 
area is not accessible in the microcanonical ensemble. 

c. The solution in the microcanonical ensemble In the microcanonical ensemble, a simple counting method has 
been recently proposed [248| |. The magnetization M = YliLi can ^ e rewritten as M = N + — N_ by introducing 
the number of spins up, N+, and of spins down, 7V_. The hrst term of the Hamiltonian (|369|) can be straightfor- 
wardly rewritten as — M 2 /(2N). As two identical neighboring spins would not contribute to the second term of 
Hamiltonian (|369[) (SiSi+± — 1 being equal to zero) while two different ones would give a contribution equal to K, the 
total contribution of the second term is KU , where U is the number of "kinks" in the chain, i.e. links between two 
neighboring spins of opposite signs. 

For a chain of TV spins, the number of microstates corresponding to an energy E can be written as 

n(N + ,N-,U)^(?+)(^ ) . (378) 



{U/2J{ U/2 J 

The formula is derived by taking into account that we have to distribute N + spins among U/2 groups and 7V_ among 
the remaining U/2. Each of these distributions gives a binomial term, and, since they are independent, the total 
number of states is the product of the two binomials. The expression is not exact because we are on a ring, but 
corrections are however of order TV and do not affect the entropy. A slight correction to formula (|378|) is present for 
small N + , AL. and U/2, and all these numbers should be indeed reduced by a unity. 

Introducing m = M/N, u = U/N and e = E/N = — to 2 /2 + Ku, one thus finally gets the entropy 

s(e, m) = — lnf2 = — (1 + to) ln(l + m) + — (1 — m) ln(l — m) — u\sxu 

— — (1 + m — u) ln(l + m — u) — — (1 — m — u) ln(l — m — u) , (379) 

which is shown in Fig. [43] for different values of the energy e and for two values of the nearest-neighbor coupling K . 

In the large A^-limit, the last step is to maximize the entropy s(e, m) with respect to the magnetization m, leading 
to the final entropy s(e) = s(e, m*), where m* is the equilibrium value. As anticipated, one gets either a second or a 
first order phase transition depending on the value of the coupling constant K. As usual, and analogously to what has 
been done in the canonical ensemble, the expansion of s(e, to) in power of m is the appropriate procedure to define 
the critical lines and points. One gets here 

s(s,m) = s (s) + Am 2 + Bto 4 + C(to 4 ), (380) 
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FIG. 43: s(e,m) for different values of the energy. Left panel presents e = —0.1 (dashed line), e c ~ —0.15 (dotted), e = —0.2 
(solid) when K — —0.25: a second order phase transition. Right panel shows the case K = —0.4 when e = —0.25 (dotted), 
e — —0.305 (dash-dotted), e t = —0.3138 (solid), e = —0.32 (dash-triple dot), e = —0.33 (dashed): a first order phase transition. 
The gaps present in the lower dashed curve are related to ergodicity breaking (see text). Note that the different curves have 
been vertically shifted for readability purposes. 



with the paramagnetic zero magnetization entropy 

So(e) 

and the expansion coefficients 
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Using these expression, it is straightforward to find the second order phase transition line by requiring that A mc = 
(B mc < 0), finding f3 c = exp(— (3 C K) , which is the same equation found for the canonical ensemble. Again, as far as 
second order phase transitions are concerned, the two ensembles are equivalent. The tricritical point is obtained by 
the condition A mc = B mc = 0), which gives Kmtp — —0.359 and Pmtp — 2.21, which is close but definitely different 
from Kctp — —0.317 and (3ctp = V3- The microcanonical first order phase transition line is obtained numerically 
by equating the entropies of the ferromagnetic and paramagnetic phases. At a given transition energy, there are 
two temperatures, thus giving a it temperature jump as for the BEG and the generalized HMF model. Similarly to 
mean-field models, also this model exhibits a region of negative specific heat when the phase transition is first order 
in the canonical ensemble. 

d. Equ ilibrium dynamics: breaking of ergodicity Model (|369p exhibits breaking of ergodicity, as shown in 
Rcf. 248]. In order to reveal the dynamical consequences of th is effect, one has to define a microcanonical dy- 
namics. An appropriate one is given by the Creutz algorithm [l22l | , which probes the microstates of the system with 
energy lower or equal to the energy E. Indeed, there are two definitions of the microcanonical ensemble, that become 
equivalent in the thermodynamic limit. In the first one, only states contained on an energy shell [E, E + dE) are 
counted (this is the definition we used througho ut thi s review) ; in the second one all the phase-space volume contained 
within the energy E hypersurface is considered [l83| |. Creutz algorithm is based on this second definition. 

The algorithm is implemented by adding an auxiliary variable, called a demon, which has the following properties. 
One initiates the procedure with the demon at zero energy, while the system has an energy E. One then attempts to 
flip a spin. The move is accepted if it corresponds to an energy decrease and the excess energy is given to the demon. 
One then attempts to flip another spin. If the flip decreases the energy, it is accepted, while if it increases the total 
energy, it is accepted only if the needed energy can be withdrawn from the demon energy. Demon serves really as a 
bank with deposit and withdrawn: however, the total energy of the "bank" is always non negative (clearly different 
from typical banks in the 21st century!). 
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Creutz dynamics can be used to test the predictions on breaking of ergodicity discussed in Sec lIV EH Indeed, as it 
is already apparent in Fig. 1431 not all magnetization values are accessible in certain regions of the (K, e) plane, which 
manifests itself in the gaps of the entropy curves shown in Figl43l This property is in turn a consequence of the non 
convexity of the space of thermodynamic parameters (see Sec. Ill CP 

To demonstrate ergodicity breaking, we show in Fig. [44] the time evolution of the magnetization in two cases: in 
the first case, where the whole magnetization interval is accessible, one clearly see switches between the metastable 
state m* = and two stable symmetric magnetized states. On the contrary, in the second example, the metastable 
state belongs to an interval which is disconnected from the stable one (see the inset for the corresponding entropy 
curve). The dynamics maintains the system either in the stable or in the metastable interval, depending only on the 
initial condition. The system is unable to jump to the other interval even when the latter is more stable. 

Similar results had been obtained using Hamiltonian dynamics for the generalized HMF model, see Fig. 




(MC sweeps)/1000 

FIG. 44: Time evolution of the magnetization for K = —0.4 (a) in the ergodic region (e = —0.318) and (b) in the non ergodic 
region (e = —0.325). Two different initial conditions are plotted simultaneously. The corresponding entropy curves are shown 
in the insets. 



2. XY model 

a. Introduction In this subsection we study a generalization of the HMF model, in which the Hamiltonian, besides 
the mean-field interaction, has a nearest neighbour interaction between rotators. The presence of such a term requires 
that we specify the properties of the lattice where the model is defined. As in subsection IVI A 1( we study a one- 
dimensional lattice with a ring geometry; the first property allows the use of the transfer integral technique, while the 
second property, equivalent to the introduction of periodic boundary conditions, is convenient for the calculations, 
but it is irrelevant in the thermodynamic limit. 

The model has been introduced in Ref. [7o| , and its Hamiltonian is given by 

N 2 N N 

Hn = E y + 2N .? [1 " cos {6i ~ 6j)] ~ K ? cos {9l+1 ~ e ^ > (384) 

i— 1 1 i—1 

with 9n+i = B\. The parameter K is the coupling constant of the nearest-neighbour interaction. For K = 0, the 
Hamiltonian reduces to the HMF model, that has a second order phase transition at T c — 0.5. The similarities of 
this model with the Ising model studied in the previous subsection are clear: there is a mean-field interaction and 
a nearest neighbour interaction. However, there is also a kinetic energy term, which allows to define a Hamiltonian 
dynamics. Also in this case the interesting properties are found for K < 0. Let us first locate the first order transition 
at T — 0. This is done, as for the Ising case, comparing the energy per particle of the fully magnetized m = 1 state 
with that of the staggered non magnetic m = state. The energy density of the former is e = —K, while that of the 
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latter is e = 1/2 + K . The magnetized state is therefore favoured for K > —1/4. Qualitatively, it is the same phase 
diagram as the one observed in Fig. |4"01 

b. Solutions in the canonical and microcanonical ensembles We adopt the min-max procedure, described in 
subsection IIV D"4l We remind that, in this procedure, once an expression for the canonical partition function has 
been obtained in the form of Eq. (] 1 55[) . not only can we compute the canonical free energy </>(/?) by the minimization 
defined in Eq. ()156|) . but we can also obtain the microcanonical entropy s(e) by Eq. (|160|) . 

Let us first derive the canonical partition function. As for the HMF model, we use the Hubbard-Stratonovich trans- 
formation. The model shares with the HMF model the invariance under global rotations, therefore the spontaneous 
magnetization is defined only in modulus, while there is degeneracy with respect to its direction. We can exploit this 
fact to simplify slightly the computation; namely, we assume from the beginning that the saddle point value of the 
variable X2 in Eq. (|130p is 0. The spontaneous magnetization is in the direction of the xi-axis, but this will not cause 
any loss of generality. We therefore obtain 
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(385) 



The integral over the #'s is performed applying the transfer integral method. For our one-dimensional geometry this 
is given by 



d9\ . . . d9pf exp 



N 
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i=l 



i=l 



(386) 



where Xji'J, c) is the j-th eigenvalue of the symmetric integral operator 



{TiP)(6) = / da exp 



-7(cos 9 + cos a) + a cos(9 — a) 



(387) 



In the thermodynamic limit, only the largest eigenvalue X max will contribute to the partition function. This eigenvalue 
can be computed numerically by a suitable discretization of the integral operator (|387j) . The rescaled free energy is 



</>(P)=/3f(P) = w£$(p,x), 



(388) 



with 



<K/3, x) = ~ In j + |(1 + x 2 ) - \ max (f3x, pK) , 



(389) 



where we have not explicitly written the K dependence of <J)(f3,x) and <p{0). 

Accordingly to the min-max procedure, the microcanonical entropy s(e) is given by 

s(e) = sup s(e, x) , 



(390) 



with 



s(e, x) 



inf 



[3e-<t>(p,x) 



inf 





/3s + ^]nj-^(l + x 2 ) + In X max (/3x, (3K) 



(391) 



Again the dependence of s(e,x) and s(e) on K has not been explicitly written. The expressions (|389p and (|39ip arc 
used to study canonical and microcanonical thermodynamic phase diagrams, respectively, and then to check if there 
is ensemble inequivalence. As for the Ising case, to obtain the critical lines on the (T, K) plane, one has to expand 
(j)(P,x) and s(e,x) in powers of x. The power expansion of \ max {Px, PK) can be obtained explicitly as a function of 
modified Bessel functions. We will avoid here the details of the calculation leading to the final result. 

Taking into account that our system is invariant under the symmetry 9 — > —9, the expansion of <p(P,x) will have 
only even powers. Let us write explicitly the K dependence of the expansion coefficients 



4>(P, x) = 0o (ft K) + MP, K) x 2 + <f> 2 (p, K) x 4 + o(x 6 ) . 



(392) 
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The canonical second order transition line in the (T, K) plane is determined by 

01 (A K ) = 0, with 4> 2 (ft A") > . (393) 
Inserting this expansion in the first equality in Eq. (|391[) and minimizing with respect to /3, one obtains 

e = <#,(/?, A) + #(0, A) + 0' 2 (/3, A) x 4 + o(x 6 ) , (394) 
where the prime denotes derivation with respect to (3. This equation gives an expansion of the form 

/?(£, A, x) = ft(e, A) + ft(e, A) a; 2 + (3 2 (e, A) x 4 + o(x 6 ) . (395) 
Using this expression, we obtain the expansion of the form 

s(e, m) = s (e, A') + si(e, K)m 2 + s 2 (e, A)m 4 + o(ra 6 ) , (396) 

where 

s (e,A) = ft(e,A) £ -0 o (ft(e,A)) ( 397 ) 
si(e,A) = p 1 (e,K)e-<&(p (E,K))p 1 (e,K)-<t> 1 (p (e,K)) (398) 

s 2 ( £ ,A) = ft(e,lOe-^ , (A)(e,ifM(e,^-^o(A)(e.-K'))A(e ) -K') 

-0'i(ft(e, ^))/?i(e, *Q - 2 (A)(e, A')). (399) 
The microcanonical second order transition line is determined by 

st(e,K)=0, with s 2 (J3,K) < 0. (400) 
Now we can use the following equalities, that are obtained by inserting back (|395[) into (|394l) 

JO) = £ (401) 
<l>'i(0o(£, K)) = -<%(po(e,K))pi(E,K). (402) 



Equation (|401[1 implies that (3q(s, K) is the temperature on the second order transition line. One obtains 

x 4 + o(x 6 ) . (403) 



s(e,x) = f3 Q (e,K)e-MM^K))~MPo(e,K))x 2 
±<ft (Aj (e, A) ) /3 2 (e, A) + 2 (ft (e, A) ) 



Therefore, the microcanonical second order transition line is determined by 

MPo&K)) =0, with i^(/3 o ( £ ,A))/3 2 ( e ,A)-0 2 (/3o(e,A)) < 0. (404) 

The concavity of 4>(f3) implies that, again on the critical line, 0g(/3o(e, A)) < (This is valid in models where parity 
with respect to x implies that <j)p x = for m = 0). Comparing Eqs. (|393|) and (|404p , and taking into account the 
last observation, we see that, as we expected from the general discussion about ensemble equivalence at the end of 
subsection IIV A 31 the points belonging to the canonical second order transition line also belong to the microcanonical 
one. The microcanonical line, however, goes beyond and includes more points, as we have already seen in several 
models, since by continuity the inequality in (|404|> is satisfied also for some points where 02 (ft (s, K)) < 0. 

The study of the canonical and microcanonical critical lines involves the analytic de term ination of X max , which, in 
the limit x — > 0, can be performed using perturbation theory for Hermitian operators |228| . For both ensembles, one 
gets 

h(J3K) 4-/3 

hW) = WTl ' (405) 

where Iq and Ji are modified Bessel functions. The canonical tricritical point turns out to be Kqtp — —1-705 and 
Tqtp — 0.267, while microcanonical tricritical point is at Kmtp — —0.182 and Tmtp — 0.234. In Fig. [45l we plot 
the relevant part of the phase diagram on the (A, T) plane. As commented above, it is shown how the microcanonical 
critical line (ending at the microcanonical tricritical point MTP) extends beyond the canonical critical line, which 
ends at the canonical tricritical point CTP. 
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FIG. 45: Canonical and microcanonical (K, T) phase diagram of the one-dimensional XY model with mean-field and nearest- 
neighbour interactions. The canonical critical line (bold solid line) ends at the tricritical point CTP indicated by a triangle; then 
the transition becomes first order (dashed line). The microcanonical second order transition line coincides with the canonical 
one at large K (bold solid line) . It continues at lower K (light solid line) down to the tricritical point MTP, indicated by a filled 
circle; then the transition becomes first order, with a branching of the transition line (dotted lines), giving the two extremes of 
the temperature jump. 




c. Ergodicity breaking We here prove that model (|384|) shows ergodicity breaking, similarly to the Ising long- 
plus short-range model. Let us first study separately the bounds of the short-range term 

1 N 

-^008(0;+!-^) . (406) 

i=l 

The study of maxima and minima of this term can be reduced to those configurations where the spins are aligned 
parallel or antiparallcl to a given direction. There is no loss of generality, due to rotational symmetry, to then choose 
this direction as the x-axis. Therefore, only the x component of to is non zero and in the following we will restrict 
to m = m x > 0. It turns out that, in the thermodynamic limit, both extrema are attained with a fraction (1 + to)/2 
of rotators parallel to the x-axis and a fraction (1 — m)/2 antiparallcl to the x-axis, for all given values of to. The 
maximum is 1 and is achieved when the parallel rotators and the antiparallcl rotators are grouped in two separated 
blocks. The minimum 2m — 1, is instead attained when the antiparallcl rotators are all isolated, which is possible 
since we are considering to > 0. 

Now wc have to distinguish the two cases K > and K < 0. In the positive case, the minimum of this contribution 
to the energy density is —K (thus actually independent of to), while in the negative case it is —K(2m — 1). We obtain 
the minimum of the total potential energy per particle just by adding (1 — m 2 )/2. This is actually the minimum of 
the total energy per particle e, since the kinetic energy is positive definite. We finally obtain that the minimum of 
the energy per particle is 

4fn°M = - K (407) 

for K > 0, and 

<SM = ^(1 - ™ 2 ) - K ( 2m - 1) (408) 

for K < 0. Plotting the two functions £^^° and £^^° we will find the accessible to values for a given energy 
e. The functions are plotted in Fig. [46] for two representative values: K = 0.2 and K = —0.2. For positive K, 
we see that below the energy —K + 1/2 the system cannot have all magnetizations, and values of to near are 
inaccessible. We note however that the region of accessible values of to, for any energy, is connected; therefore there 
is no ergodicity breaking. Nevertheless, we remark that the accessible region in the (e, m) plane is not convex, as 
it can happen only for long-range interactions. For negative K, we note instead, that for a given energy interval 
(K + 1/2 < e < 2K 2 + K + 1/2 for K > -1/4 and -K < e < 2K 2 + K + 1/2 for K < -1/4), the attainable values 
of to are separated in two disconnected regions. Therefore for K < we have breaking of ergodicity. 
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FIG. 46: Accessible regions in the (e, m) plane for the one- dimensional XY model with both mean-field and nearest neighbour 
interactions. Solid curves represent a) e m ^j°(77i) (|407[) and b) £min( m ) l|408|) . The accessible and inaccessible regions in 
parameter space are indicated, a) K — 0.2: no breaking of ergodicity is present since at fixed energy e the accessible interval of 
m is connected (dotted line); b) K = —0.2: there is breaking of ergodicity for values of e between 0.30 and 0.38. For instance 
for e = 0.35 the are two disconnected intervals of accessible magnetizations (dotted lines). 



B. Weakly decaying interactions 

All the models that have been considered in previous sections have an infinite range term in the Hamiltonian. 
This undoubtly limits the applicability of our analysis to realistic physical systems, where interactions decay with 
the distance. We present below four models of this kind which are direct generalizations of models presented above 
and represent a step forward in the study of long-range weakly decaying interactions. Some of the models are exactly 
solvable and reproduce features of the phenomenology observed for infinite range systems (ensemble inequivalence, 
negative specific heat, ergodicity breaking, etc.). 



1. ot-Ising model 
Let us consider the one-dimensional a-Ising Hamiltonian 

H N = -L- y \ gfa , (409) 
N i- a \i-j\ a K 1 

i>j=l 1 J] 

where J > and spins Si — ±1 sit on a one-dimensional lattice with either free or periodic boundary conditions (in 
the latter case, \i — j\ is the minimal distance along the ring). The A rQ_1 prefactor is intro duced in order to have an 
extensive energy as explained in Sec. Ill Al This model has been first introduced by Dyson |l38l | and studied for the 
"integrable" case, a > 1, in the canonical ensemble without the N 01-1 prefactor. We show here that it is possible 
to obtain an exact microcanonical solution using large deviation theory when < a < 1 [H, H3, . Moreover, the 
study of this model gives also the opportunity to emphasize the important role played by boundary conditions when 
the interactions are long-range. 

We adopt the same scheme described in section IIV El to obtain the solution of model (|409|) . The method can be 
generalized to lattices of higher dimension. 

In the first step, the Hamiltonian Hn is rewritten in terms of global variables by introducing a coarse- graining. Let 
us divide the lattice in K boxes, each with n = N/K sites, and let us introduce the average magnetization in each 
box mk, k = 1...K. In the limit N — > oo, K — > oo, K/N — > 0, the magnetization becomes a continuous function 
m(x) of the [0, 1] interval. After a long but straightforward calculation, described in Ref. [37|, it is possible to express 
i/jv as a functional of m(x) 



H N = NH[m{x)]+o(N), 



(410) 
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where 

H[m(x)] = i tdx [\y l - m[X)m{v) . (411) 

1 1 J 2 J J \x - y\<* V 1 

The estimation is uniform on all configurations. 

In the second step, we evaluate the probability to get a given magnetization m k in the fc-th box from all a 

priori equiprobablc microscopic configurations. This probability obeys a local large deviation principle P(mfc) oc 
exp[ns(mfc)], with 

_/ \ l+m k l + m k 1-TOfc. l-m fc ,. 10 , 

*("**) = 2 — 2 2 — 2 — ■ ^ ' 

Since the microscopic random variables in the different boxes are independent and no global constraints has been 
imposed, the probability of the full global variable (mi, . . . , mx) can be expressed in a factorized form as 

K K 

P( mi ,m 2 ,...,m K ) = n p K)-II eri5(mi) ( 413 ) 

i=l 

s{m,i) 



exp 



i=l 

K 



(414) 



~ e^>Wl (415) 

where s[m(a;)] = f s(m{x))dx is the entropy functional associated to the global variable m(x). Large deviation 
techniques rigorously justify these calculations (57j . proving that entropy is proportional to TV, also in the presence of 
long-range interactions. This result is independent of the specific model considered. 

In the third step, we formulate the variational problem in the microcanonical ensemble to get the entropy 

s(e) = sup (s[m(x)] \e = H[m(x)] ) . (416) 

m(x) 

Let us remark that this optimization problem has to be solved in a functional space. In general, this has to be done 
numerically, taking into account boundary conditions. In the case of free boundary conditions, the only available 
solutions are numerical. An example of a maximal entropy magnetization profile obtained for free boundary conditions 
is shown in Fig. [?7] for different values of a. The profile become more inhomogeneous when increasing a (for a = 
one recovers the mean- field result with a homogeneous profile). 

In the following, we will treat the periodic boundary conditions case, for which analytical result can be obtained. 
Both entropy and free energy can be obtained in analytical form for homogeneous magnetization profiles, which have 
been shown to be locally stable in both the high-temperature and the low- temperature phase [37| . It has indeed been 
proven that for j3 < /3 C = (1 — a)/(J2 a ) there is a unique global maximum of s(e), corresponding to a constant zero 
magnetization profile. The variational problem (|416[) . where s is defined in Eqs. (|412[) . (|414p and (|415[) . leads to the 
consistency equation 

tanh" 1 (mix)) = J [ dy, (417) 

Jo \x-y\ a 

where (3 is a Lagrange multiplier. For /3 > (3 C , we restrict ourselves to constant magnetization profiles, which are 
locally stable, i.e. close non constant profiles have a smaller entropy. In this case, using the relations J dx\x — y\~ a = 
2 a /(l — a) and £ max = l/(2/3 c ) one can obtain the magnetization vs. energy curve m = ± \J1 — e/e max , see Fig. l48f a). 
Moreover, fixing the energy implies fixing the magnetization and, consequently, the Lagrange multiplier (3 in Eq. ()417|) . 
Expressing the magnetization in terms of the energy in the entropy formula (|416[) allows us to derive the caloric curve, 
see Fig. l48lb) (solid line). The limit temperature j3 c (dotted line) is attained at zero magnetization, which is a 
boundary point. 

In the canonical ensemble, one has to solve the variational problem (|195|) . This leads to exactly the same consistency 
equation (|417p . where the Lagrange multiplier is replaced by the inverse temperature {3. Solving this consistency 
equation for (3 > 0, one finds a zero magnetization for (3 < (3 C and a non vanishing one for (3 > (3 C . One can also derive 
the canonical caloric curve, which is reported in Fig. 1481( b) and superposes to the microcanonical caloric curve from 
(3 = oo down to (3 C while it is represented by the dashed line for (3 < (3 C . It follows that in the region [0, /? c ], the two 
ensembles are not equivalent. In this case, a single microcanonical state at £ max corresponds to many canonical states 
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FIG. 47: Equilibrium magnetization profile for the a-Ising model with free boundary conditions at an energy density e = 0.1 
for a = 0.2 (solid line), a = 0.5 (dotted line) and a — 0.8 (dashed line). 




FIG. 48: a) Equilibrium magnetization in the allowed energy range in the microcanonical ensemble for the a-Ising model with 
a — 0.5; the negative branch is also reported with a dotted line, b) Inverse temperature versus energy in the microcanonical 
ensemble (solid line). The canonical ensemble result superposes to the microcanonical one in the interval [/3 c ,oo) and is 
represented by a dashed line for [3 £ [0,/3 c ]. /3 C is then the inverse critical temperature in the canonical ensemble. In the 
microcanonical ensemble, no phase transition is present. 



with canonical inverse temperatures in the range [0,/3 c [. Thus, in such a case, the canonical inverse temperature is 
not equal to the microcanonical one. In the microcanonical ensemble, the full high temperature region is absent and, 
therefore, no phase transition is present or, in other terms, the phase transition is at the boundary of the accessible 
energy values. The entropy is always concave, hence no inequivalence can be pre s ent in the allowed energy range, 
apart from the boundaries. This situation is often called partial equivalence [83l . ll4lll300l |. Partial equivalence persists 
for all a values below one, and is removed only for a = 1 when e max — > oo and (3 C — > 0: the phase transition is not 
present in both ensembles and the system is always in its magnetized phase. The main drawback of this analysis is 
the difficulty to obtain analytical solutions of Eq. (|417p for non constant magnetization profiles, which is the typical 
situation when boundary conditions are not periodic. 
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2. a-HMF model 



Let us consider now a generalization of the HMF model ()126[) which has been originally proposed in Ref. [1, |293{ . 
The interaction between two rotators has the same form as in the original model, but the coupling constant is a weakly 
decaying function of the distance between the two lattice sites where the rotators sit. Note that we consider the lattice 
in general dimension d. In Ref. [75j], the generalized case of n- vector spin models has been analyzed, including the 
HMF model (for n = 2) and the Ising spins (for n = 1). Only periodic boundary conditions have been considered, as 
we will do here. 



The Hamiltonian of the a-HMF model is 



N 2 1 N 



g W = V? + — E l - cos ^-^ , (418) 

where is the minimal distance between lattice sites i and j (minimal distance convention in periodic boundary 
conditions). We consider here cases for which the exponent a, determining the decay with distance of the coupling 
constant between rotators, is less than the spatial dimension d, so that the system is long-range as shown in Sec. IIIBI 
The TV prcfactor is introduced in order to have an extensive energy as explained in Sec. Ill Al We show below that, 
in any spatial dimension d and for each a < d, the thermodynamics of this model is the same as that of the HMF 
model; in particular, there is a second order phase transition at the inverse temperature (3 = 2. 

Let us introduce the matrix of the coupling constants = l /rj* . Since the diagonal elements of this matrix are 
not defined, we assign to them the common arbitrary value b. The prefactor is 

JV 



N = E J v ( 4l9 ) 



that does not depend on i, because we adopt the minimal distance convention for ry. It is clear that in the thermo- 
dynamic limit TV — > oo, the prefactor TV behaves as N x ~ a / d . The arbitrary parameter b, which appears only in TV, 
becomes negligible in thermodynamic limit. Its introduction is however useful for the computation, as it will become 
clear. 

The most relevant steps of the solution in the canonical ensemble (we refer the reader to Ref. [75[ for details) can 
be obtained starting from the partition function 

/ 9 \N/2 . ( B N 

Z(f3, TV) = ^— J J dfli . . . d6 N cxp I -^4- E J H I 1 " cos (^ " I 
2tt\ N/2 ( TV/T 

exp ( — ) / d6>i . . .dOpf cxp | y ] [cos^i?^ cos 6^ + sin^i?^ sin.9j] | , (421) 



f N 

/ ddi . . .d0N cxp E] [cos 9iRij cosdj + smdiRij sm9j] 



where we have introduced the matrix Rij = j3Jtj/(2N) and used the property J,j = TV TV. In order to use the 
Hubbard-Stratonovich transformation, one diagonalizes the expression in the second exponential. Denoting by R the 
symmetric matrix with elements Rij, and by V the unitary matrix that reduces R to its diagonal form D = VRV T , 
one gets 



N N 



E cos 6 l R i] cos Oj = E R ^ 2 i - ( 422 ) 



where the quantities Ri arc the real eigenvalues of the matrix R, and a, = X/jLi cos9j. An analogous expression 
holds for the sum with the sines in Eq. (|42ip . In order to express exp(iJjaf) using the Hubbard-Stratonovich trans- 
formation, the eigenvalues Ri must be positive. By choosing a sufficiently large value of 6, but in any case of order 1, 
all eigenvalues Ri are positive. One can therefore write 
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In this iV-dimensional Hubbard-Stratonovich transformation, we have used that det R = YiiLi we have introduced 
the unitary change of integration variables defined by yi = Y^.j—i Vij z j an d we have indicated with the inverse 
matrix of R. Using the analogous expression for the term with the sines, we arrive, after performing the integration 
over the angles 6, to the following expression for the partition function 



N 



i=l 



(425) 



This integral can be computed using the saddle point method, since most eigenvalues Ri vanish in the thermodynamic 
limit [75|]. Introducing pi = y/z? + z 12 , the stationary points are solutions of the following consistency equations 

±Y (R-^.-zj = W\~ (426) 
2 ^ V ,%3 3 lo (Pi) p l V ; 

which, after inversion with respect to z%, z[, become 



N 



2 E^fr4- ( 428 ) 



3=1 

N 



< - 2 E^#4-- ( 429 ) 

fri h {pj) pj 

Let us first look for homogeneous solutions, i.e., for solutions in Zi, z\ that do not depend on i. The previous system 
reduces to the following pair of equations for z = z% and z' = z[ 

* = 4j4 £ and = ( 43 °) 

where we have used that 2 X3?=i -^y = f° r each i. By expressing z = pcos7 and z' = psin7, we see that these two 
equations determine only the modulus p = V z 2 + z' 2 , leaving the angle 7 undetermined. They thus reduce to a single 
equation for the modulus 

= ( 4 3D 

This equation, after the change of variable p = (3m, coincides with Eq. (|134[) of the HMF model. 

In order for this homogeneous stationary point to be the relevant one for the evaluation of the integral in Eq. (|425j) . 
it is necessary that the exponential has an absolute maximum at this stationary point. One can easily show that the 
homogeneous stationary point is a local maximum (75| . Here, we will directly prove that it is also a global maximum 
with respect to any other possible inhomogeneous stationary point. 

Let us then go back to Eqs. (|428[) and (|429[) and suppose that they have inhomogeneous solution, considering as a 
first case a solution with different values for the moduli pi = \J zf + z' 2 . If k is the site with the largest modulus p k , 
let us choose the x-axis of the XY rotators such that Zk = Pk and z' k = 0. Then one can write 

Zk = Pk = 2 f; %fH- ^ 2 E ButH ( 432 ) 



< 



2 ^"M^-^)' (433) 
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where use has been made of the monotonicity of the function Ii/Iq. We emphasize that the last inequality is strict 
since, by hypothesis, not all moduli pi are equal. From the properties of the function Ii/Iq, it follows from the 
last inequality that also the largest modulus p^ of this inhomogeneous stationary point is smaller than that of the 
homogeneous solution. One gets the same conclusion for a possible stationary point with all equal moduli but with 
different directions; in that case inequality (|433[) becomes an equality, but inequality (|433[) is strict. The proof is 
completed by showing that the exponent in Eq. (|425|) . if computed at a stationary point, has a positive derivative 
with the respect the moduli pi of the stationary point. In fact, it can be easily shown that at a stationary point the 
exponent in Eq. (|425|) , let us call it A, can be written as a function of the moduli pi of the stationary point in the 
following way 



N 



A = J2 



2 I [Pi) 



(434) 



Its derivative with respect to pi is 



OA 
dpi 



h (Pi) , g h{ Pi ) 
h(pi) l dpiIo{pi) 



(435) 



The concavity of the function Ii/Iq, for positive values of the argument, assures that the right hand side is positive. 
This concludes the proof. 

We conclude that for each dimension d and for each decaying exponent a < d, the free energy of the q-HMF model 
with periodic boundary conditions is the same as that of the HMF model. Maybe the derivation of the solution in 
the microcanonical ensemble does not pose particular technical problems [3~i| . 



3. One- dimensional gravitational models 

A direct study of the full three-dimensional i V-bo dy gravitational dynamics is particularly heavy [l73j | and special 
purpose machines have been built to this aim |203| . Therefore, lower dimensional models have been introduced to 
describe gravitational systems with additional symmetries. For instance, the gravitational sheet mode l, describing the 
motion of infinite planar mass distr ibutions p erpendicularly to their surface, has been considered |l79| . Although this 
model shows inter estin g behaviors [210l . l302l |. the specific heat is always positive and no phase transition is present. 
Long ago, Henon [l75| | introduced a system of concentric spherical mass shells that interact via gravitational f orces . 
This model has been recently shown to display phase transitions and ensemble inequivalence in a mean- field limit [3191 ] . 

In this section we disc uss th e phase diagram, in both the microcanonical and the canonical ensemble, of the Self- 
Gravitating-Ring (SGR) [288| | model. In this model, particle motion is constrained on a ring and particles interact 
via a true 3D Newtonian potential. The Hamiltonian of the SGR model is 

N 



H " = Jl>i+^r£^-^): ( 436 ) 



2N 

i=i i,j 

with Vs(9 l -9j) = — -= — 1 (437) 

where 5 is the softening parameter, which is introduced, as usual, in order to avoid the divergence of the potential at 
short distances. Taking the large 5 limit, the potential becomes 



1 - cos (ft - 9j) 
25 



- 1 



+ 0(<T 2 ) , (438) 



which is the one of the Hamiltonian Mean-Field (HMF) model (| 1 26[) . Since the interaction is regularized at short 
distances, a global entropy maximum always exists, and thermodynamics is well defined in the mean-field limit. In a 
situation close to that of the HMF model e.g. for 6 = 10, the caloric curve determined from microcanonical numerical 
simulations is reported in Fig. l4"9l a). In the homogeneous phase e > £t(<5), the caloric curve is almost linear, while 
in the clustered phase s < £*(<$), it is bent downward. Nonetheless, temperature always grows with energy and 
one does no t ob serve any negative specific heat energy range. However, as it happens for 3D Newtonian gravity 
simulations [173| |. when one reduces the softening parameter, a negative specifi c hea t phase develops. For instance, in 
Fig. l4*9Tb). we show two cases at small 6 where three phases can be identified |288l |: 
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FIG. 49: Caloric curves of the Self Gravitating Ring (SGR) model obtained from numerical simulations of Hamiltonian (|436l) . 
Panel (a) refers to the softening parameter value S = 10, for which a second order phase transition appears at e t . No backbending 
of the caloric curve, indicating a negative specific heat, is present. Simulations were performed for N = 100. Panel (b) shows 
the caloric curves for two different values of the softening parameter, 6i = 1.0 x 10~ 6 and 82 = 2.5 x 10~ 7 , and N = 100. 
The transition is here first order in the microcanonical ensemble. A negative specific heat phase appears for e a < £ < £t, and 
expands as the softening parameter is reduced. 



• a low-energy clustered phase for e < e a , where e a is denned as the energy at which the negative specific heat 
region begins. 

• an intermediate-energy phase, e a (d) < e < £t{8), with negative specific heat. 

• a high-energy gaseous phase for £t(S) < e. 

The clustered phase is created by the presence of softening, without which the particles would fall into the zero 
distance singularity. In the gas phase, the particles are hardly affected by the potential and behave as almost free 
particles. The intermediate phase is expected to show the characters of gravity, persisting and even widening in the 
6 -»• limit. 

It is shown analytically in Ref. |296| that ensembles are not equivalent and a phase of negative specific heat and 
temperature jumps in the microcanonical ensemble appears in a wide intermediate energy region, if the softening 
parameter is small enough. In Fig. [5Ql that is drawn for 6 = 10~ 5 , wc show all the features that we had already 
commented in the context of the study of the phase transition of the BEG model (jIV Bp . In particular, we would like 
to draw the attention of the reader to the similarities of Fig. ISTJb with Fig [T^b . 

The phase transition changes from second to first order at a tricritical point, whose location is not the same in 
the two ensembles. In Fig. [51] we represent the (5-dependence of the critical energy e t and the so-called coexistence 
region. The transition line separating the low energy and high energy phases has a tricritical point: phase transitions 
are first order at small <5 and second order at large S. 



4- Dipolar interactions in a ferromagnet 



In Sec. IIII1 where we have discussed several physical examples of long-range interacting systems, we have remarked 
that dipolar interactions are marginal as far as the long-range character of the interaction is concerned. Indeed, in 
this case, the exponent governing the decay of the interaction potential is a = d = 3. In spite of this, we will show 
here that magnetic dipolar interactions offer a relatively simple way to observe a direct manifestation of long-range 
interactions in samples of suitable shape. 

Generally, Heisenberg exchange interaction between electronic spins is much higher (usually a few order of magni- 
tudes) than magnetic dipolar energy, and therefore the only role played by the dipolar interaction is the introduction 
of anisotropics of the spontaneous magnetization. On the other hand, magnetic ordering in n uclea r spins, governed by 
dipolar interaction alone, would require experiments performed at nanokelvin temperatures [256l | . Here, we describe 
an example in which the dipolar interaction between electronic spins acquires, through the careful preparation of the 
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FIG. 50: Temperature (panel (a)) and entropy (panel (b)) versus energy e for the softening parameter value 5 = 10 -5 . Four 
values of the energy, indicated by the short-dashed vertical lines, can be identified from this picture: ei ~ —93 and £2 — 6 
bound from below and above the region of inequivalence of ensembles. et — is the first order phase transition energy in the 
microcanonical ensemble. e a ~ —66 limits from below the negative specific heat region, (j^ 1 ~ 15, represented with a dashed 
line in panel (a), is the canonical transition temperature. The solid lines represent the analytical solutions of temperature and 
entropy in the high energy phase. They are extended slightly below e t , in the metastable phase. The insets in panels (a) and 
(b) show a zoom of temperature and entropy around et, revealing a temperature jump. 



sample, an effective strength in a suitable range of temperatures, such that its long-range character gives rise to an 
interesting dynamical effect related to a phase transition [77} . 

Compounds of the type (C^^y+iNHa^CuCLj have been considered in Refs. [283l . 12841 . |313| |. where the interest 
was the observation of intrinsic localized modes (discrete breathers). These compounds, organized in a face centered 
orthorombic crystal, are layered spin structures, in which the weak magnetic interlayer interaction is antiferromagnetic 
for v > 1 and ferromagnetic for u = 1. The relevant variables are the s = 1/2 spins of the Cu 2+ ions, that are placed 
in two-dimensional layers. The hard axis of the magnetic interaction is orthogonal to the layers, that therefore are the 
easy planes. However, also the in-plane interaction is anisotropic: an easy and a "second easy" axis can be determined. 
The cited works focused on the antiferromagnetic case v = 2. Here, we are interested in the ferromagnetic case v = 1, 
i.e. to the compound (CiHsNHa^CuCLt, called bis(Methylammonium) tetrachloro-copper [9l| . 

When writing the Hamiltonian of the system, we adopt a coordinate frame in which the hard axis is the x-axis, while 
the z-axis and the y-axis are the easy and "second easy" axis belonging to the two-dimensional layers, respectively. 
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FIG. 51: Phase diagram for the Self-Gravitating Ring model, see Eqs. (|436fl and (|437|) . The dash-dotted line represents the 
upper energy of the metastable low energy phase, the dashed line is instead the lower energy of the metastable high energy 
phase. They bound the region of phase coexistence. The dotted line joining the points is the phase transition line, which 
changes order at the microcanonical tricritical point (MTP). The filled (open) circles belong to the first (second) order phase 
transition line. 



The Hamiltonian reads therefore 



H = -W ]T 

I,<i,j> 



v v 



Ij 



■ E 

I,<i,j> 



S IV S 



E 



2/4 



s Ii 



- 3 



(~s*ii ■ ■ ~r) 



(439) 

where the first, second and third sum represents the intralayer exchange interaction, the interlayer exchange interaction 
and the dipolar interaction, respectively. The capital indices number the layers, while the lowercase indices denote 
the spins. In the first sum < i, j > refers to nearest-neighbors within the same layer and in the second sum to 
nearest- neighbors in adjacent layers; the last sum is extended over all pairs of spins of th e system. The intralayer 
exchange constant W is much larger than the interlayer one w, for example W ~ 10 4 u> [l37j |. The parameters £ and r), 
both smaller than 1 are related to the out-of-plane and in-plane anisotropics. In the dipolar interaction, /is is Bohr's 
magneton and r is the modulus of r , the vector between the sites of the spins s u and s jj. 

The large W/w ratio determines the existence, for a given shape of the sample, of a temperature range in which all 
the spins of a single layer are ferromagnetically ordered, such that the spin vector s n can be considered as independent 
of the index i, ~sj = ~s H, while full three-dimensional ordering has not yet been reached. This situation arises if the 
temperature is below the single layer ordering temperature, of the order of W, and if in addition the number n of 
spins in each single layer is such that nw < W . Under such conditions the thermodynamic and dynamical properties 
of the system will be determined by the interlayer exchange constant w and by the long-range dipolar interaction. 
Exploiting the fact that all spins in a layer are ordered, the Hamiltonian can be cast in a one-dimensional form. 
To this purpose, one has to use a procedure employed in the treatment of dipolar forces, in which the short-range 
contributio n (h ere including the interaction between nearest-neighbors) and the long-range contribution are treated 
separately [216| . The latter contribution gives rise to shape-dependent terms. Here, we consider a rod-shaped sample, 
with short sides along the out-of-plane x-axis and the in-plane y-axis, and long sides along the in-plane z-axis. With 
this shape, the long-ra nge dipolar interaction produces an interaction term in the Hamiltonian which is typical of 
an ellipsoidal shape [2I, |216| |. The magnetization in the sample varies only alon g the £-axis. If we denote by TV the 
number of layers of the sample, the Hamiltonian is then transformed to 0, Il47l | 
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(440) 



The first two sums come from the first sum in Eq. (|439|) . and describe the intralayer exchange interaction. Here 
B x = 4W(1 — £) and B y = 4W(1 — r;). Since for this compound the out-of-plane anisotropy is much larger than 
the in-plane one, we have that B x 3> B y ; a constant additive term involving W has been neglected. The third sum 
in Eq. (|440[) is a combination of the interlayer exchange interaction (second sum in (|439[) ) and the nearest-neighbor 
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interlayer dipolar interaction, while the intralaycr nearest-neighbor dipolar interaction produces a constant term that 
has been neglected. In this sum, u ex is given by 

^=2„-«(2-||). (441) 

where r a and rt> are the distances between nearest-neighbor spins within the same layers and adjacent layers, respec- 
tively. The last two mean-field terms in Eq. (|440p come from the long-range part of the dipolar interaction. This 
contribution is proportional to a combination of the square of the components of the magnetization density vector, 
with the coefficients of the combination depending on sample shape. In this case, where the magnetization varies 
along the x-axis, one has to consider the average magnetization density vector Q. In these mean-field terms, ujm is 
given by u>m = (47r/3)(2^/«o), where i>o is the volume of the unit cell of the lattice. In the effective Hamiltonian 
(|440p . we have neglected all terms including the x component of the spins which emerge from dipolar or exchange 
interlayer forces, since B x is much larger than B y , wm and cu ex . 

Hamiltonian (|440[) governs the dynamics of the vector spin s j through the torque equation 

h ^T = ~?J x ™J > ( 442 ) 
at 

where Hj = —dHw/dltj is an effective magnetic field acting on the spin ~sj. The numerical results that we are 
going to present shortly are based on the integration of the last equation. To make a connection with the analytical 
results, we introduce an approximation in the Hamiltonian. We define S j = 2~sj, S V j = yl — (S x j) 2 sm6j and 
S z j = yl — (S x j) 2 cos 6*j, which yields the equations of motion in terms of the angular variables 6j and Sj. Noting 
that B x is much larger than all the other parameters, which implies that y/l — (Sj) 2 f» 1, the equations of motion 
are simplified as follows 

h^- = -nB x S x , . (443) 

This dynamics is consistent with the following effective Hamiltonian, which includes only the angles fj's, and their 
time derivatives 

n-i f> N i / N \ 2 i / N \ 2 

(444) 

where we have introduced a dimensionless time via the transformation t — > tny/B x u>m /K. We note the similarity of 
this Hamiltonian with the Hamiltonian (|384|) of the chain of rotators coupled by a nearest-neighbor interaction and 
a mean-field one studied in subsection IVI A 2l With respect to Hamiltonian (|384[) . the one given in Eq. (|444[) is not 
invariant under global rotations, because of the presence of the term proportional to B y and of the difference between 
the coefficients of the two mean- field terms. However, if B y > 0, which is the case experimentally, these differences 
do not change qualitatively |228l | the phase diagram obtained in subsection IVI A 21 

Let us now discuss the dynamical effect that could in principle be observed in experiments with this type of samples. 
We have shown that the XY model with nearest-neighbor and mean-field interactions has, in particular, a second 
order ferromagnetic phase transition for all positive values of the nearest-neighbor coupling. We have also shown that, 
below the critical energy, there is another energy threshold, below which ergodicity is broken, with an inaccessible 
range of values of the magnetization around (contrary to the case K > of subsection IVI A 21 the inaccessibility 
of a magnetization range around has to be considered a breaking of ergodicity, since now the lack of invariance 
under global rotation can forbid the passage from m x > to m x < 0). For the particular values of the parameters 
in Hamiltonian (|444[) . one finds [77j that the critical energy is e c ~ 0.376, and the threshold below which there is 
breaking of ergodicity is e = —0.3. Besides that, because of the lack of global rotational invariance, the spontaneous 
magnetization has only two possible directions, i.e., 9j = and 9j = it. If we consider the dynamics of such a 
system, at an energy between the threshold for ergodicity breaking and the critical energy, and with a finite number 
of degrees of freedom, we should therefore observe that the modulus of the magnetization will fluctuate around that of 
the spontaneous magnetization, and from time to time will flip between the two directions of the magnetization. We 
expect that the flips will be more and more rare as the system size increases and approaching the ergodicity breaking 
threshold [13] ■ The sudden flips are due to the fact that, contrary to short-range systems, the formation of domains 
with different directions of the magnetization is not possible. 

Numerical experiments have been performed using Hamiltonian (|440[) and the torque equation (|442p fn\ . We 
here present results concerning the two energies e = 0.2 and e = 0.4 (in the adimensional values computed after 
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the transformation to Hamiltonian (|444[) ). The first value is below the critical energy but still far enough from the 
crgodicity breaking threshold to observe magnetization flips on a reasonable time scale. The second value is above 
the critical energy, and the system should fluctuate around zero magnetization. In Figs. [52j we show the results of 
the numerical simulations that confirm the expectations. 

From the numerical data, it is also possible to obtain the probability distribution function for the magnetization, 
shown in the insets of Figs. [52] by the open circles. The logarithm of the probability distributions obtained numerically 
is compared with that computed analytically at the corresponding energy values on the basis of Hamiltonian (|444p . The 
comparison is then made between the numerical distribution functions obtained by direct simulations of Hamiltonian 
(|440p and the expression s(e,x) given by Eq. (|39ip . obtained for Hamiltonian (|444[) . The agreement is good although 
not perfect. The effect which leads to disagreement is twofold. First, one should have obtained the expression of 
s(e,a;) for Hamiltonian (|440p . which is in principle possible. Second, we have taken x as playing the role of the 
spontaneous magnetization m. However, we have already emphasized several times that the parameter x coming 
from the Hubbard-Stratonovich transformation coincides with the spontaneous magnetization only at equilibrium. 
The reasonably good agreement that we obtain justifies a posteriori this identification, which is valid only close to 
equilibrium. 

In summary, our main purpose here was to see that a system which is realizable in the laboratory can be used to 
observe the peculiarities predicted for long-range interacting systems. We have emphasized the importance of sample 
shape. For instance: for spherical or approximately spherical, samples, spin flips will not be observed, since the 
mean-field terms in the Hamiltonian will not introduce any anisotropy. 
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FIG. 52: Numerical simulations of the one dimensional spin chain model, Eqs. (|440|) and (|442[) . with two different energies: 
e = 0.2 (left panel) and e = 0.4 (right panel), i.e. below and above the second order phase transition energy e c ~ 0.376 
predicted for Hamiltonian (|444|l . The number of layers in the simulations is N = 100. The insets show the corresponding 
entropy curves (solid lines) obtained for Hamiltonian (|444|l and the data obtained from the probability distribution function of 
the magnetization (open circles) for Hamiltonian (|440p . 



VII. CONCLUSIONS 



A. Summary 

It should be now clear that the dynamics and thermodynamics of systems with long-range interactions is a rich 
and fascinating field. It has been shown how, changing the range of the interactions, deeply renew several aspects of 
statistical mechanics. We have encountered many unusual properties: ensemble inequivalence, negative specific heat 
and negative susceptibility, ergodicity breaking, quasi-stationary states, algebraic relaxation and anomalous diffusion. 
Rather than being contradictory with standard statistical mechanics, they extend its domain of validity. It should 
be remarked that the analysis of statistical ensembles for long-range systems privileges the use of microcanonical 
ensemble, since all canonical macrostates can be realized in the microcanonical ensemble, but the converse is not true: 
this is the essence of ensemble inequivalence. These properties are of course not found in all models and for any value 
of the thermodynamic parameters. Table [J summarizes the main features of the models studied in this review. 

We have mostly restricted our analysis to mean-field models. However, we have shown that the properties that we 
have discussed extend also to non mean-field interactions, as presented in Sec. IVII Recently, new analytical methods 
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Model 


Variable 


Ensemble 
Inequivalence 


Negative 
specific heat 


Negative 
susceptibility 


Ergodicity 
Breaking 


Computable 
Entropy 


Section 


BEG 


D 


Y 


Y 




Y 


Y 


IIVBI 


3 states Potts 


D 


Y 


Y 


N 


N 


Y 


IIVE2I 


Ising L+S 


D 


Y 


Y 




Y 


Y 


IVIAll 


a-Ising 


D 


Y 


N 






Y 


IVIB1I 


HMF 


C 


N 


N 






Y 


I1VDI 


XY L+S 


C 


Y 


Y 




Y 


Y 


IVIA2I 


q-HMF 


C 


N 


N 








IVIB2I 


Generalized HMF 


C 


Y 


Y 




Y 


Y 


IIVE4I 


Mean-Field 4 


C 


Y 


N 


Y 


Y 


Y 


IIVE5I 


Colson-Bonifacio 


C 


N 


N 


N 


N 


Y 


IIVE6I 


SGR 


C 


Y 


Y 


N 


Y 


Y 


IVIB3I 



TABLE I: Summary of the important properties of the different models studied in this review. C for continuous and D for 
discrete. Y for Yes, N for No, while the slot is left empty when the answer is not known. L+S means long-range plus short-range 
interactions. 



have been developed to solve models with long-range interactions at equilibrium: they all aim at the direct derivation 
of the entropy. We have in particular presented in full detail the large deviation method in Sec. IIVEI Relaxation 
to equilibrium is a key topic in systems with long-range interactions, which historically began with the realization 
that self-gravitating systems cannot converge to statistical equilibrium. By considering a much simpler model, that 
nevertheless is representative of long-range interacting systems, we have shown that relaxation to equilibrium proceeds 
in two steps. On a fast time scale, one observes relaxation to quasi-stationary states that are successfully described 
using a theoretical approach proposed by Lyndcn-Bcll. On a longer time scale, one indeed observes relaxation 
to Boltzmann-Gibbs equilibrium. In connection with this complex relaxation to equilibrium, we have extensively 
discussed kinetic equations for systems with long-range interactions (Klimontovich, Vlasov, Lcnard-Balcscu, Fokkcr- 
Planck). All this is presented in Sec. fVl 



B. Open problems 

Before closing this review, we would like to briefly comment on a few topics that we have discussed but which would 
need a wider treatment since it is clear that the present review opens a "Pandora box" . 

Another source of negative specific heat which has not discussed at all in this review is thermodynamic instability. 
Even if a system is short-range, attractive and the interaction is regularized at short distances, energy density can be 
unbounded from below and both potential energy and kinetic energy will increase as a conse quence of the formation 
of a cluster. This phenomenon has been discussed in a s eries of papers beginning with Refs. 121 , 266j |. 



For two-dimensional and geophysical flows, Chavanis [107l |. as well as Bouchet and Venaille [3081 ] . obtain different 
equilibrium states depending on the shape of the domain. Following a classification of phase transitions previously 
derived by Barre and Bouchet [6pj ] . they exhibit the first example of bicritical points and second order azeotropy 
for long-range interacting systems. There is a close connection between two-dimensional fluid dynamics and two- 
dimensional elect ron plasmas. In this latter context, experiments have shown the relaxation tow ards different quasi 
stationary states [I54ill84 , l208{ . and recently theoretical frameworks have been proposed [6^, [202]. Electron plasmas 
could also constitute experimental setups to observe in the future the new type of phase transitions predicted in 
Ref. [13]. 

While the initial violent relaxation has been successfully characterized, the slow collisional process that leads to the 
final equilibrium is still an open field of research. Chavanis has carefully investigated this problem and has proposed 
a rich kinetic theory app roach [95[ . As an application, collisional relaxation has been numerically investigated for the 
HMF model in Ref. [Hf- 

Concerning self-gravitating systems, one should quote the remarkable observation of a sequence of transient stat es in 
the long-term stellar dynamical evolution. Density distributions of such states are well fitted by stellar polytropes |294l . 



12951 ] . Similar observations have been reported for the HMF model in Ref. [71( . Recently, the problem of the robustness 
of quasi-stationary states with respect to noise has been investigated [23| . Gravitation al clu stering in finite and infinite 
systems has been numerically investigated by Joyce, Sylos Labini and collaborators |29l| . These authors have tried 
to assess if simulations performed with a finite number of particles reproduce the mean-field Vlasov limit and to 
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understand the universal features of halo structures. 

An interesting direction of investigation is the one followed by Barre and Gongalves, that considers models on 
random graphs. Using the large deviation cavity method, analytical solutions of the Potts spins systems on a random 
fc-regular graph have been recently derived in both the canonical and microcanonical ensembles. The analytical 
solution, confirmed by numerical Metropolis and Creutz simulations, clearly demonstrate the presence of a region with 
negative specific heat and, consequently, ensemble inequivalence between canonical and microcanonical ensembles fill ]. 

An alternative method has been used to des crib e quasi-stationary states. These states, according to the method 
of non extensive statistics proposed by Tsallis [30 1\ , should be maximum entropy states of a generalized Boltzmann 
entropy defined as S q — ks (1 — y! T - V(<7 ~ where pi is the normalized probability of state i. As correctly 
pointed out by Tsallis and coworkers [1621 . |219I | , the presence of quasi-stationary states is tightly linked to the non- 
commutability of the infinite time limit with the thermodynamic limit for systems with long-range interactions. 
Velocity distributions that maximize Tsallis entropy S q are g-Gaussian defined by G q (x) = A(l — (1 — q)r)x 2 ) 1 ' 1 ~ q , 
where A is a normalization constant and rj c ontrols th e width of the distribution. These distributions do not convinc- 
ingly fit numerical data of the HMF model 162, 219]. Moreover, no theoretical approach has been proposed which 
leads to the derivation of such distributions for a specific case, like the HMF model. For this reason, this theory has 
not shown any predictive power in this field. 

Although unsuccessful to explain quasi-stationary states, Tsallis approach has led to the introduction of interesting 
models and concepts. A relevant model, that we also discuss in Sec. IVI B 21 is the a-HMF model, which has been 
introduced in Ref. Q to discuss the behavior of the maximal Lyapunov exponent in systems with long-range inter- 
actions: this is still an open and interesting question. Algebraic deca y of time correlatio ns and anomalous diffusion 
within the HMF model were discovered by Rapisarda and coworkers [2181 . 12631 1264| . 1265] . However, the approach to 
this problem developed in Sees. fVB 21 and [V B 31 is able to explain all numerical observations with out resorting to non 
extensive statistics. Moreover, although motivated by non extensive statistics, the work in Ref. |247l ] concludes that 
quasi-stationary states, ori ginal ly looked at by Rapisarda and coworkers, do not show anomalous diffusion, as also 
previously reported in Ref. [3l4 | . 



C. Perspectives 

The study of long-range interactions has certainly overcome the pioneering era where it has been important to 
become acquainted with basic facts and concepts that looked odd initially, like negative specific heat. Having passed 
to a more mature period, this field, besides strengthening its foundations, should incorporate methods and tools 
developed in other disciplines and look towards the realizations of well devised experiments that could lead to the 
observations of the effects predicted by the theory. 

From the mathematical point of view, one is faced to interesting and difficult problems for what concerns both 
equilibrium statistical mechanics and out-of-equilibrium dynamics. It would be important to develop large deviation 
methods in order to allow the treatment of systems with slowly decaying interactions as was done for instance for 



the a-Ising model 37[ and of systems with both short and long-range interactions. For what concerns the dynamical 
properties, several significant advances have been recently obtained in kinetic theory concerning the study of the 
solutions of Vlasov-Poisson and Vlasov-Maxwell systems. Both the difficulties of incorporating in the theory the 
singularity of the C oulom b or gravitational potential and the enlargement of the treatable initial conditions have been 
addressed (see Ref. |l29j |). 

When going to systems where the interaction is long-range, but not mean-field, the difficulty of performing numerical 
simulations is related to the fact that algorithms are intrinsically of order N 2 for each time step, where N is the number 
of pa rticles. Because of this, for gravitational systems, dedicated computers have been built (e.g. GRAPE in Japan 
[203j ]) and sophisticated numerical codes have been developed [l|. The issue of devising better algorithms is central 
for the improvements of the simulations and has not yet been seriously faced. 

We have contributed with this review to convey the idea that many of the effects that were previously att ributed t o 
gravitational systems appear also in other systems, as was early stressed in two-dimensional hydrodynamics [l09ll245| . 
We have already mentioned in Sec . IIIII two-dimensional hydrodynamics, two-dimensional elasticity, charged and dipolar 
systems, small systems. We would like to concentrate here on three physical examples that are more likely to lead in 
the near future to experimental realizations: the free electron laser, cold atoms and lattice dipolar systems. 

The cooperative effect leading to coherent emission of radiation in the Free Electron Laser (FEL) has been described 
in Sec. IIVE61 using an analogy with phase transitions. In a realistic setting, convergence to Boltzmann-Gibbs 
equilibrium will never occur because of the extremely elevated number of electrons, compared to the length of the 
wiggler. That's why it is highly probable that FELs will always operate in regions of parameters dominated by 
quasi-stationary states. As we know from the analysis of such states in the HMF model, they show phase transitions 
from homogeneous to inhomogeneous macroscopic distributions (see Sec. IV B4| . Such transitions, together with the 
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characterization of quasi-stationary states, could be the object of an experimental study with linear FELs [391 ] - 

Cold atomic systems offer a promising laboratory for testing the predictions of the statistical mechanics of systems 
with long-range interactions. These systems range from ionized atoms confined by an external potential which interact 
via unscreened Coulomb repulsion, to neutral atoms that interact by means of Van d er Waals type forces. Additionally, 
dipolar gases have been recently the subject of extensive experimental studies [246j . Initially, some attention had been 
paid to the possible realization of true gravitational- like classical interaction induced by external laser fields [254| . 
However, this hope has failed until now due to technical difficulties. When the wave-like behavior of matter can 
be neglected, leading to a semi-classical approximation, and laser light is far from resonance, a fully Hamiltonian 
(microcanonical) treatment of Bose-Einstein condensates is viable. Moreover, ensemble inequivale nce for bo sonic gases 
is reminiscent of the ensemble inequivalence encountered for long-range interacting systems [50l . l31ll . l32ll |. However, 
it has been later realized that this type of ensemble inequivalence is of a different nature and should be analyzed 
in itself. A potentially fruitful seed has recently emerged in the study of cold atoms in optical high-Q cavities. For 
these systems, the interaction between two atoms is independent of the distance, because they are coupled via the 
back-scattering of photons within the cavit y. The interaction can lead to collective instabilities and self-organization 
phenomena, known as CARL [53|, 1 1291 . l286l | . Models of the CARL phenomenon have strong similarities with those of 
FEL. Therefore, we expect that the study of phase transitions and of quasi-stationary states could be realized also 
for these laboratory systems. 

We finally argue that dipolar condensed matter systems should be seriously considered to check some features of 
long-range interactions. Indeed, it has been remarked since long time that composites like Ho RhB^ , where dipolar 
interactions dominate over Heisenberg exchange interactions, have a mean-field critical behavior [258| . More recently, 
such results have been extended to diluted models whi ch si mulate the effect of disorder, making the Shcrrington- 
Kirpatrick spin-glass behavior experimentally accessible (255| . The importance of long-range dipolar interactions has 
been also recognized for pyrochlore lattice structures, that show the strongly degenerate and frustrated "spin ice" 
phase [H. 

In the final section of this review, we have briefly mentioned experimental applications, because we think that 
theoretical results obtained in the study of systems with long-range interactions have nowadays reached a level of 
understanding that might allow, in the near future, the experimental verifications of curious effects like negative 
specific heat, ergodicity breaking, quasi-stationary states. 
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Appendix A: Proof of min-max inequality 

Let us consider the function of two variables f(x,y). Under quite general conditions on /, the following inequality 
holds 

supinf f(x, y) < inf sup/(a;, y) . (445) 

x v v x 

Let us give a sketchy proof of this property, which is used in the text to prove that microcanical entropy (|160[) is 
always smaller or equal than canonical entropy (|158[) . Let us denote by the extremum that satisfies the l.h.s. 

of ([415]) . then 



f{x\,yi) < f(xi,y) , Vy 



(446) 



108 



Similarly, the extremum that satisfies the r.h.s., denoted (0:2,2/2) satisfies 

/(z 2 ,y 2 )>/(x,j/ 2 ) , Vz. (447) 

Then, 

f(xi,Vi) < f(xi,V2) < f(x 2 ,y 2 ) , (448) 

which proves inequality (|445[) . 

Appendix B: Evaluation of the Laplace integral outside the analyticity strip 

The microcanonical partition function in formula (|162|) can be expressed, using the Laplace representation of the 
Dirac 5 function as follows 



1 f 

{l(e,N) = — dXe NXe Z(X,N), (449) 

27TI J jj_ ioo 



with fi > 0; this is the expression given in (|164|) . As explained in subsection IIV D 41 we divide the integral in (|449p 
in three intervals, defined by Xj < —S, —5 < Xi < 5 and A/ > S, respectively, with < 5 < A. Here we show that 
the contribution to the integral in A coming from values of Aj outside the strip, i.e. for values of Xj with |A/| > A, is 
exponentially small in N . 

Let us then consider first the value of Z(X,N) in the two external intervals, i.e., for |A/| > A. We have 

Z(/3 + iX I ,N) = ]T cxp{-/3ff({5 i })}cxp{-iA / ir({5 i })} . (450) 
{Si,...,S N } 

We see that this expression is proportional to the canonical expectation value (exp (— iXiH)}, that we expect to be 
exponentially small for large N. We confirm this expectation rewriting the last expression as 

Z(/3 + iXi, N) = N { deexp(-N[/3e + iXis- s(e)]) , (451) 



where we have used the expression of the canonical partition in terms of the microcanonical entropy as in (|6lj) . This 
is an integral with a large phase. For N going to infinity, its value will be determined by the value of the integrand 
for e equal to the integration extremes and to the values of the possible nonanalyticities of s(e), all denoted by £k 
(see, e.g., Ref. HU). We then have 

Z(f3 + iXj,N)~ A^Cfeexpt-Al/fe^A^-s^)]) , (452) 
fc 

where Ck are coefficients that could in principle be determined. It is clear that the successive integration over A/ 
in any one of the two external intervals of integration will then give a vanishing contribution, due to the very large 
oscillations. We are then left with 

n(s,N) / dXe NXe Z{X,N), (453) 

which is the first equality given in Eq. (I105D . 



Appendix C: Differentiability of the function </>(Ai, . . . , A n ) 

We give a proof that the function 0(Ai, . . . , A„), defined in Eq. (|193|) . is differentiable, for real values of the Xi, 
when the functions /Xfe(x) are given by sums of one-particle functions 



1 N 

JLk{x) = — Y^9k(qi,Pi) fc=l,...,n. (454) 



N 

i=l 
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In this case, the canonical partition function is given by 



Therefore we have 



Z(\i, . . . , A„) = dpdq exp 



k=l 



N 



0(Ai, . . . , A„) = -In dpdq exp 



^2^k9k(q,p) 



k=l 



(455) 



(456) 



In the range of A.; where the integral in Eq. (|456[) is defined, (j) is continuous. Besides, differentiating with respect to 
A, we have: 



(Ai, . . . , A„) 



J dpdq gi(q, p) exp 


n 

-^2^k9k(q,p) 

. k=l 


/ dpdq exp 


n 

-^2^k9k(q,p) 

. k=l 





(457) 



Also this function is continuous, under the only hypothesis that the expectation value of the observable gi(q,p) is 
finite for all the allowed values of the in the canonical ensemble defined by Z(X\, . . . , A„). 



Appendix D: Autocorrelation of the fluctuations of the one-particle density 

Using the definition of the Fourier transform and formula (|27ip , one gets 



(Sf (k,p,0)8f(k',p',0)) 



{ ^l 2V ^e-^ k ' e '\5f(9,p,0)Sf(9',p',0)) (458) 
2^ I 2^ e ~ l{kS+k ' S ' )N [{fd ( >Pi °) fd (0',P', 0)) - fo (p) fo ( P ')\ ■ (459) 



The expression of the discrete density function (|26ip leads then to 



1 / N 

(f d (9,p,0)f d (9', P ',0)) = — (9-6^6 (p- Pj )S{0 -^Sfp- ^)) 

+ J25(9-e j )5(p- Pj )5(9' -9i)5(p' - Pi ) 

N (f d (9,p, 0)) S (9 - 9') 5 (p - p') + N(N - l)/ 2 (0, 9,p, 9',p' 
fo ( P ) S (9 - 9') 5 (p - p') + f (p) fo (p 1 ) + h 2 (9,p, 6', P ', 0), 



1 

1 

N 



(460) 

(461) 
(462) 



where we used the following definition of the correlation function h 2 

f 2 (9 7 p,9',p',0) = {5{9-9 ] )5(p-p ] )5(9 l -9 l )5{p' -p.i)) (463) 
N 

= jj—j [fo (P) fo {p') + h 2 {6,p, 9',p')] . (464) 
Substituting expression (|462p in Eq. (|459| . we find 

(Sf(k,p,0)5f(k',p',0)) = e-^°6( P -p>)+ jf*^ £ Ne-W+ k '^h 2 (9,p,0>,p'i465) 

hip) x ^ , 1 x _ _ .. n „ „ ^ (466) 

(467) 



h,-k'S [P-P') + 7T- S k,-k' K k ,PiP') 
ZTT Z7T 



5k - 



fe.-fe' 
2tt 



[fo(p)S(p-p')+fx(k, P ,p')} 



In the passage from (|465p to (|466p . we have used the fact that h 2 depends only on the difference 9 — 9' . Besides, it 
decays rapidly to zero in a range (9 — 9') ~ 1/N, so that /i(k,p,p') is of order 1. 
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Appendix E: Derivation of the Fokker-Planck coefficients 



As mentioned in Sec. IV B 21 the derivation of the Fokker-Planck equation (|328[) is performed in the time range 
1 <C t <C N. Within this approximation, expressions (|329|) and (|330p are thus replaced by 



A{p,t) = -((P(t)-P(0))) p(0 )= p 
B(p,t) = ^(( P (t)-p(0)f) m=p . 



(468) 
(469) 



Looking at Eq. (|327j) . we see that, at the order 1/N, we need to compute the following averages 



(470) 



and 



1 / d 2 Sv dSv 



(471) 



for the first moment, and 



^ (^(*(0) +P(0)h,h)^(6(0) + P (0)t 2 ,t 2 ) 



(472) 



for the second one. The fact that the position and the momentum of the test-particle at time arc given is specified 
inside the dependence on 9 of the derivatives of Sv. The condition of given initial angle and momentum of the 
test-particle produces corrections of the averages in Eqs. (|470p . ()471|) and (|472|) . with respect to their unconditioned 
values. These corrections being of order 1/y/N, one gets that at order 1/N they have to be taken into account only 
for Eq. (|470[) . Note that the unconditioned value of that equation vanishes. 

Let us start with Eq. (|472[) . To compute it. it is necessary to rewrite Eq. (|313p as 



(Sv(k, Lj)5v(k', w')> = 27r 3 4,- fc < (4.i + 4.-i) 5 ^ + UJ '\ / dp' f ( P ')5(Lo - p'k) . 



D(w,k) 



(473) 



By the inverse Fourier-Laplace transform, we then have 



-^-2 y y 

{2tt) 2 J-^ 

v ' k — — oo k'—— o 



duj I dJ{-kk') e **(«(0)+P*O-*-*i 

oo 



J2 / dcjk 2 ^^-^ 1 -^ "*' 1 ' I dp' f (p')5(ui-p'k) (475) 

_ J — OO 



k— — oo 
+oo 



„2 p »(fcp-uQ(ti-t 2 ) flM + 4- i 
5(w,jfc) 



(474) 



dw ■ 



/oM 



e i(p-0))(«i-t 2 ) _|_ g— i(p-o<)(ti— f 2 ) 



(476) 



Equation (|327| shows that the second moment appearing in the right hand side of Eq. (|469j) is determined by the 
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integral of the last expression in t± and t-2 from to t. More precisely 



B(p,t) 



1 1 7T 

tN2 



t r +oo 



dti / dt 



did ■ 



JO J-oo 



11 f +ac / ( W 



l i 

In* 



tX 



D(w,l) 



D(w,l) 



ds / dto 



e i(p-w)(ti-t 2 ) _|_ g— t(p-w)(ti— 1 3 ) 



i(p-w)« _|_ e -»(p-ti))« 



i — >+oo 



JV" 



TX 



2tt / dcj 



/o(w) 



o Jo 
t 

2 i ds (i — s) 





3 »(p-ti))« _|_ e -i(p-w)a 



iV 



-2^ 



/o(p) 



2 



7T(J (p - 



Dip, 1) 



iV 



£>(p). 



which is expression (|332|) . 

For what concerns Eq. (|471[) . starting again from Eq. (|473[) . we obtain 



k— — oo k' 



(477) 
(478) 
(479) 
(480) 
(481) 



x e «='(e(o)+pt2)-io»'t a (^(^^^(ib'.w')) (482) 

if g/ + Lfc 3 e ^-)^-^) ^i±^ [dp'f (p')5(u;-p'k) (483) 
' fc) J 



fe=-c 
r+oo 

i- / dw 

z J-oo 



e i(p-w)(ti-t 2 ) e -i(P- w )(*i-*2) 



(484) 



£>(w,l) 

Equation (|327[) shows that the contribution Ai(p, t) to the first moment appearing in the right hand side of Eq. (|468|) 
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is given by 
A x (jp,t) 



1 f* f u f Ul I d 2 Sv dSv 

du I dui / du 2 (-^-(9(0) + pu 7 u)-^-(9(0) + pu 2} u 2j 



t 



o Jo 



— — — / du I dui du 

t N 2 ' 



1 1 



o Jo 



dto ■ 



e i(p-u)(u-u 2 ) _ e -i(p-w)(«-« 2 ) 



= 7rlm / duj 

t N 



/o(w) 



- du du 2 (u — u 2 ) e 
" Jo Jo 



i(p—ui)(u—U2) 



1 1 

tN' 



+ 00 

7tRg / du ■ 



D(u,l) 



duj Jo 



-— 7rRe / duj 
t N 



d / (w) 



1 1 

tN' 



+ 00 

7T / did 



d "\D{w,l) 
d / (w) 



X _ gi(p-^)u-M2 



9w 



sin p — wlit 
aw 

P — LO 



1 1 

tN' 



+ 00 

2tt / dcj 



/ 



d /o(w) 



9cj 



V 



D(w,l) 



1 

— 'i 

N 



d fotp) 



duj ■ 



1 



sm 



oo (p — U)) 

1 d 



dp 



\ 



Ndp 



D{p). 



(485) 
(486) 

(487) 

(488) 

(489) 

(490) 

(491) 

(492) 

(493) 



D( P ,l) 

For what concerns Eq. (|470p , let us rewrite Eq. (|305|1 as 



5v(k, lu) = —- 



tt(4,i+4,-i) , Sf(k,p',0) 



D{uj,k) 



i(p'k — lu) 



(494) 



where we have neglected the k = 0, that will not contribute. The condition on given initial angle and momentum of 
the test particle implies that 



Sf(k,p',0) 



1 1 

N2tt 



e -M(°)s(p-p>). 



We therefore have 



/ dSv 



(0(O)+pu,u)) = - 
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1 1 
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N 47T 

1 1 
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1 1 
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1 1 
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D(u},1) 



2tt 



3 fo(p) 



D(p, 1) 



(495) 

(496) 

(497) 
(498) 

(499) 
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From Eq. (|327[) , one sees that the contribution A2 (jp, t) of this term to the first moment appearing in the right hand 
side of Eq. (|468[) is given by 



t — >+oo 



1 1 1 

t N Att 


Jo 1 


1 ^ 

7T - 

N 2 


/o(p) 


Dip, 1) 2 


1 ^ 
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£(p, 1) 2 
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AT 2* " 
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+00 

dcj 



1 



111 

Re 



00 P-w 



ZT(u;,l)e l(p - w)u) 



2^ J 



/5(p) 



2 Re 7? 



do; ■ 



1 £>>,!) 



(p - 



. 2 p — LU 

2% sin h sm(p — c<j)i 



iV2 

1 1 

+ N2 n 



%1) 
/o(p) 



5 = iv 7r 



/5(p) 



11% 



D(p). 



(500) 
(501) 
(502) 
(503) 



Adding Eqs. (|493| for Ai(p,t) and (j503j) for A 2 (p,t) we obtain Eq. (|33i"|) for A(p,f). 
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